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METHOD AND APPARATUS FOR THE PRODUCTION OF 
ANTIGENS AND USES THEREOF 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims priority under 35 U.S.C. § 119(e) of provisional U.S. 
Serial No. 60/256,410, filed December 18, 2000, entitled "HLA PRODUCTION FROM 
GENOMIC DNA/'and provisional U.S. Serial No. 60/256,409, filed December 18, 2000, 
entitled W HLA PROTEIN PRODUCTION FROM CDNA/' the contents of which are hereby 
expressly incorporated in their entirety by this reference. 

This application is also a continuation-in-part of U.S. Serial No. 09/465,321, filed 
December 17, 1999, entitled "METHOD AND APPARATUS FOR THE PRODUCTION OF 
SOLUBLE MHC ANTIGENS/' the contents of which are hereby expressly incorporated 
in their entirety by this reference. 

In addition, this application is a continuation-in-part of U.S. Serial No. 
09/974,366, filed October 10, 2001, entitled "COMPARATIVE LIGAND MAPPING FROM 
MHC POSITIVE CELLS/' the contents of which are hereby expressly incorporated in 
their entirety by this reference. 

STATEMENT REGARDING FEDERALLY FUNDED RESEARCH 

At least a portion of the invention was developed under funding from the 
National Institute of Health ("NIH") under contract Nos. Nol-Al-45243 and Nol-Al- 
95360. As such, the U.S. Government may own certain rights in and to this 
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application. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The field of the invention relates in general to at least one method and 
apparatus for the production of soluble MHC antigens and more particularly, but 
not by way of limitation, to at least one method and apparatus for the 
production of soluble Class I and II HLA molecules (i.e. sHLA). The field of the 
invention also includes such produced soluble Class I and II HLA molecules and 
their uses. According to the methodology of the present invention, the soluble 
Class I and II HLA molecules can be produced from either gDNA or cDNA 
starting material. One such exemplary, but non-limiting, use is the formation 
of a tetramericsHLA (or other multimeric complex) complex which may be used 
to test immunogenicity of peptide ligands of interest - i.e. will a peptide ligand 
of interest provoke a CTL response and/or preferentially bind a CTL 

2. Brief Description of the Background Art 

Class I HLA molecules are polymorphic human glycoproteins that 
endogenously bind and then extracellularly present peptide ligands to CD8 + T 
lymphocytes. Polymorphisms within the class I peptide binding groove are 
positioned to moderate ligand binding and presentation to such immune system 
cells. To date, the small quantities of natural ligands available to those of skill 
in the art has limited the understanding of precisely how polymorphism alters 
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peptide binding and in turn, vaccine development and, more basically, if a 
peptide ligand of interest will provoke a CTL mediated immune response. In 
order to address the impact of polymorphism upon antigen presentation, the 
inventors developed the novel approach disclosed herein - i.e. that ligand 
presentation overlaps exist across the polymorphisms and that these overlaps 
distinguish divergent class I peptide binding grooves. Utilizing this novel 
approach and coupling it with a unique hollow-fiber cell culture scheme and 
utilizing a mass spectrometric ligand mapping approach, large quantities of 
peptides eluted from soluble class I and class II molecules can be obtained for 
detailed analyses, vaccine development, and functional testing. 

Initially, peptide ligands were extracted from five different HLA-B15 
allotypes and subsequently examined. Mapping and characterizing the ligands 
obtained from these allotypes demonstrated that they: (I) vary in length from 
7 to 12 residues; (ii) are more conserved at their C termini than at their N- 
proximal residues; and (iii) are presented as overlaps contingent on C-terminal 
preferences. These results provide insight into class I and class II ligand 
loading not available via other methods, demonstrating that an elemental role 
is played by a peptide ligand's C terminus during endogenous binding and 
provides the starting material for a multimeric complex to be used to test the 
functionality of a peptide ligand of interest. The data obtained, and disclosed 
herein, validates, and illustrates the unique methods disclosed herein for the 
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production of sHLA from either gDNA or DNA starting material and the uses to 
which this sHLA material may be put. 

Class I and class II MHC molecules, designated HLA class I and class II, 
respectively, in humans, bind and display peptide antigens upon the cell 
surface. The peptides they present are derived from either normal endogenous 
proteins ("self") or foreign proteins ("nonself"). Nonself proteins include items 
such as the products of malignant transformation or intracellular pathogens. 
In this manner, class I and class II molecules convey information regarding the 
internal fitness of a cell to CD8 + CTLs which are activated upon interaction with 
"nonself" peptides. Such activation may lead the CD8+ CTLs to kill and/or 
suppress a cell which is malignant or contains intracellular pathogens. 

Examination of HLA by serologic and molecular methods by those of 
ordinary skill in the art continues to demonstrate that the class I HLA- A, B, and 
C molecules are encoded by the most polymorphic genes in mammals. 
Translating class I polymorphism into the tertiary structure of the class I 
molecule indicates that residues positioned to affect class I peptide presentation 
to T lymphocytes are most frequently affected by the mutagenic events which 
diversify class I loci. Throughout the world, HLA class I molecules exhibit a high 
degree of polymorphism that is generated by systematic recombinatorial events 
and collectively allows for the presentation of a vast array of different peptides. 
Depending upon allelic composition, two individuals' molecules may not 
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necessarily bind the same peptides with equal affinity or even at all. 

While the general structure and function of MHC class I molecules has 
been reasonably well studied and established, their polymorphic nature and 
how they specifically influence the capacity of class I in peptide binding and 
presentation remains an issue of persistent inquiry by those of ordinary skill in 
the art. The nature of precise overlaps in peptide binding specificity to HLA 
class I is particularly ill-defined at the current time due to the complexity of 
peptides bound. For example, this and other issues must be clarified in order 
to effectively pursue vaccines capable of eliciting protective CTL responses 
across an extensive population range. Unraveling the functional significance of 
class I polymorphism is an important issue that requires an understanding of 
how the mutagenic events diversifying the class I binding groove differentially 
moderate the presentation of peptide ligands. 

The heavy chains of class I molecules are encoded within the MHC and, 
upon assembling into heterodimers with the light chain, p 2 m, are responsible 
for selectively gathering endogenously processed peptides. Once peptides are 
collected, mature class I molecules transport the bound peptides to the cell 
surface where receptors on CD8 + T lymphocytes engage the class I molecules 
to inspect the ligands. CTLs may then be triggered by class I molecules bearing 
virus or tumor-derived peptides. 

With respect to the background art, hereafter numerous references are 
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disclosed which detail one or more aspects of the background art as it relates 
to the novel methods and uses of the present invention. As such, each 
reference listed should be understood as being wholly incorporated by reference 
herein in their entirety as though the reference were fully transcribed herein. 
In this manner, one of skill in the art given the present specification would be 
fully informed and could truly appreciate the novel and unprecedented nature 
of the invention(s) disclosed and taught herein. The full and complete citations 
for each reference are appended hereto after the detailed description and 
before the claims. 

The class I molecules expressed upon the nucleated cells of all vertebrate 
systems studied to date (for example, Ennis et al. 1988; Winkler et al. 1989; 
Kaufman etal. 1990; Lawloretal. 1990; Trowsdale 1995; Prilliman et al. 1996; 
Antao et al. 1999) are heterodimers composed of a glycosylated 45 kDa heavy 
chain (a-chain) and a 12 kDa light chain (p 2 m). In humans, heavy chains are 
encoded at 3 loci (B, C, and A) within the MHC on the short arm of chromosome 
6 (FIG. 1A). FIG. IB illustrates each a-chain comprised of a u oc 2/ and cc 3 
domains, as well as a transmembrane domain, which tethers the molecule to 
the cell surface and a short C-terminal cytoplasmic domain (Bjorkman and class 
I location and heavy chain coding region). 

X-ray crystallography (Bjorkman etal. 1987a; Madden etal. 1991; Saper 
et al. 1991; Madden etal. 1992; Collins etal. 1995; Reid et al. 1996; Smith et 
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al. 1996a; Smith etal. 1996b; Glitheroetal. 1999) has illustrated details of the 
structural relationship of the extracellular oc-chain domains and p 2 m (FIG. 2). 
The membrane-proximal domains, oc 3 and P 2 m, associate in an immunoglobulin 
fold structure. The membrane-distal a x and cr 2 domains together create a 
closed basket-like structure that sits atop the a 3 and p 2 m structure (FIG. 2A). 
It consists of two a-helical "walls" with a "floor" created by eight anti-parallel 
p-sheets (Bjorkman et al. 1987a; Madden 1995; and FIG. 2, B and C). In the 
earliest studies, detection of electron density situated in the a A /a 2 groove 
helped to clarify the experimentally-suspected occupancy of peptide fragments 
8-10 residues long and thus the function of class I molecules in presenting such 
peptides upon the cell surface (Bjorkman et al. 1987b). Initial crystallographic 
studies designated six subsites, A-F (FIG. 3), or "specificity determining 
pockets," that constitute the peptide binding groove (Garrett et al. 1989; Saper 
et al. 1991; Fremont et al. 1992; Matsumura et al. 1992; Elliott et al. 1993; 
Young et al. 1995). In addition to crystal structure analyses, thermodynamic 
stability studies indicate that networks of hydrogen bonds to structural residues 
lining the A- and F-pockets, which lie at opposite ends of the groove, serve to 
fasten a peptide by its N and C termini, respectively (Bouvier and Wiley 1994); 
these two pockets are thus implicated in the fixed orientation of a peptide 
within the binding groove. 

Class I molecules primarily associate with peptide fragments, thus 
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forming a-chain/p 2 m/peptide trimolecular complexes, via an endogenous 
processing pathway during their assembly (Germain 1994; Heemelsand Ploegh 
1995; Lehner and Cresswell 1996; York and Rock 1996; Pamer and Cresswell 
1998); in fact, the very binding of peptides is essential for the stabilization and 
expression of these molecules (Ljunggren et al. 1990; Townsend et al. 1990; 
Elliott 1991). The class I cc-chain and p 2 m are cotranslationally translocated 
into the ER lumen (Townsend et aL 1990; Germain and Margulies 1993; 
Neefjes et al. 1993), where the cc-chain remains anchored to the ER 
membrane and is stabilized prior to association with p 2 m and/or peptide 
through interactions with various chaperone proteins, including BiP, ER-60, 
calnexin, calreticulin, and tapasin (NoSner and Parham 1995; Sadasivan et al. 
1996; Suh etal. 1996; Spee and Neefjes 1997; Harris et al. 1998; Lindquist et 
al. 1998). Although several alternative proteolytic processing and transport 
pathways, certainly exist (for example, Hsu etal. 1991; Henderson etal. 1992; 
Kozlowski et al. 1992; Roelse et aL 1994; Snyder et al. 1994; Ferris et al. 
1996; Craiu et al. 1997; Luckey et a!. 1998; Mosse et al. 1998; Wang et al. 
1998; Young et al. 1998), it Is believed that the majority of peptides the 
nascent class I molecules interact with are delivered to the ER in a distinct 
series of events. 

Proteins in the cytoplasmic compartment are first enzymatically degraded 
into peptides of relatively uniform length by an ATP-dependent proteasome 
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complex (Coux et al. 1996). Some proteasome components include the IFN-y 
inducible subunits LMP2 and LMP7; these are themselves encoded within the 
MHC (Gaczynska et al. 1994). The typically nonameric fragments produced are 
then actively conveyed across the ER membrane via a dimer of TAP1/TAP2, an 
MHC-encoded ATP-binding cassette transporter (Monaco et al. 1990; Parham 
1990; Grandea III et al. 1995). Once inside the ER, a processed peptide can 
be captured within the aja 2 groove of a class I molecule and a stable trimer 
is formed (Germain and Margulies 1993; Smith et al. 1995). This trimeric cc- 
chain/0 2 m/peptide complex is then transported through the Golgl complex and 
ultimately to the extracellular surface. The processes of class I assembly and 
transport are summarized in FIG. 4. 

Following cell surface localization, mature complexes of class I bearing 
peptide antigens become available for interaction with receptors on monocytes, 
B and T lymphocytes, and NK cells (Townsend and Bodmer 1989; Yokoyama 
1993; Lanier and Phillips 1996; Borges etal. 1997; Cosman etal. 1997). Their 
primary, and to date most thoroughly examined, natural receptor appears to 
be the TCR of T lymphocytes bearing the CD8 heterodimer. Site-directed 
mutagenesis and crystallographic studies indicate that the Voc and V p domains 
of heterodimeric TCRs associate in a diagonal fashion across the top surface of 
the structure formed by MHC 0^ and cc 2 (Hogan et al. 1988; Lombardi et al. 
1991; Moots 1993; Tussey et al. 1994; Garboczi et al. 1996; Garcia et al. 
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1996; Parham 1996; Bjorkman 1997; Smith and Lautz 1997; Manning et al. 
1998). The hypervariable CDRs of V a /V p contact specific regions of this 
interface (FIG. 5). Both precursor and effector CTLs are defined as being class 
I-restricted in that they are only capable of recognizing and responding to 
antigens displayed in the context of these molecules (Zinkernagel and Doherty 
1974). 

Since the antigens presented to CD8 + T lymphocytes are predominantly 
obtained through the processing of intracellular proteins as previously 
described, class I molecules figuratively serve as external banners that 
advertise the inner contents of the cells. Indeed, these antigens of peptide 
ligands indicate to the Immune system as a whole which cells are to be 
eliminated and/or protected. Malignancies and/or pathogens effectively use this 
system to camouflage their existence and thereby escape detection and 
elimination by CD8+ CTLs, for example. 

Thymic education of lymphocytes prevents activation in response to 
characteristic cell-derived peptides (Robey and Fowlkes 1994), but peptides 
acquired through the degradation of atypical proteins are recognized and induce 
cytolysis (Townsend et al. 1985; Gotch et al. 1988; Walker et al. 1988; Clark 
et al. 1995). Therefore, it is not surprising that CD8 + T lymphocytes play a 
critical role in controlling and/or eliminating infected and neoplastic cells. 

CD8+T lymphocytes are implicated in immunity to pathogens such as 
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viruses, which are intracellular invaders that utilize the host cell's biosynthetic 
machinery to produce their own foreign proteins (Yap et al. 1978; Lin and 
Askonas 1981; Jamieson et al. 1987; Harty and Bevan 1992; Riddell et al. 
1992; Kulkarni et al. 1995; Heslop et al. 1996; Schmitz et al. 1999). CTL 
responses are likewise extended to include stimulation by aberrant proteins 
such as those associated with malignancy (Vose and Bonnard 1982; Muul et al. 
1987; Coulie et al. 1992; Melief 1992; Kittlesen et al. 1998; Shichijo et al. 
1998). In fact, class I molecules are capable of binding and presenting to CTLs 
any protein introduced into the endogenous processing pathway by either 
natural or artificial means (Gooding and O'Connell 1983; Moore et al. 1988; 
Yewdell and Bennink 1990; Bertoletti et al. 1991; Donnelly et al. 1993; 
Ikonomidis etaL 1994; Ballard etal. 1996; Dayetal. 1997; Goletzetal. 1997; 
Kim et al. 1997). This knowledge serves as a motivating factor behind the 
development of both protein/peptide-based vaccines and other therapeutics 
intended to elicit protective CTL responses to microbial pathogens and other 
abnormalities, which otherwise remain cytoplasmically concealed from 
detection. 

Fully understanding the role of class I molecules in ligand presentation as 
described above is complicated by a-chain polymorphism. Class I structural 
differences resulting from genetic variation confer extreme heterogeneity upon 
regions of the molecule that interact with peptides. The knowledge of how 
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polymorphism specifically impacts the natural presentation of peptide epitopes 
upon the cell surface is consequently limited. 

Class I MHC polymorphism was first documented in mice (Gorer 1936; 
Gorer 1937; Nathenson et al. 1981) and next studied in humans by serology 
(Dausset 1958; Payne and Hackel 1961; van Rood 1969); however, first protein 
and then DNA sequencing studies precisely demonstrated this genetic variability 
to be most concentrated throughout the exons of the and oc 2 heavy chain 
domains at positions affecting amino acid residues that line the walls and floor 
of the previously described peptide binding groove (Orr et al. 1979; Tragardh 
etal. 1979; Rojosetal. 1987; Parhametal. 1988; Parham etal. 1989; Parham 
, et al. 1995). It is the more centrally-located binding pockets (B-E), together 
with specific residues within the F-pocket, that appear to be the most 
polymorphic (Chelvanayagam 1996; Kostyu et al. 1997). Changes in the 
physicochemical properties of amino acid side chains within the groove can 
influence the stability with which given peptides interact during the assembly 
of a-chain/p 2 m/peptide trimers (Matsui et al. 1993; Rohren et al. 1993; Salter 
1994; Young et al. 1995). Therefore, despite the overall structural 
conservation illustrated among class I a-chains (Bjorkman and Parham 1990; 
Madden 1995), their peptide binding grooves can vary drastically from one 
allelic form to another; as a result various isoforms are capable of associating 
with distinct arrays of peptides (Elliott et al. 1993; Smith et al. 1995; Smith et 



-12- 



WO 02/062846 PCT/USO 1/49744 

al. 1996b). 

The characteristic polymorphism observed among class I molecules is 
thought to originate primarily through recombination and gene conversion 
(Kuhner et al. 1991; Parham et al. 1995); point mutations are believed to 
contribute more rarely to the pool of new alleles continually arising (Parham et 
al. 1989). Individuals inherit a set of three class I genes from each parent, and 
since their expression is codominant, a single person may therefore display up 
to six different HLA class I molecules upon his or her nucleated cells. From 
these alleles, new forms evolving progressively within populations can be 
passed on to subsequent generations and likewise serve as templates upon 
which yet further diversity may be introduced. This occurs through events such 
as single or double recombination (Parham et al. 1988) or nonreciprocal 
exchanges between c/s-oriented gene segments during gene conversion 
(Parham 1992). Serological cross- reactivity studies, as well as locus-specific 
PCR amplification and sequencing analyses, have verified the existence of allelic 
subtypes, or allotypes, of closely related alleles that appear to have arisen from 
a common ancestral template by these molecular mechanisms (for example, 
Payne et al. 1978; Ooba et al. 1989; Hildebrand et al. 1994; Prilliman et al. 
1996). While both inter- and intra-locus genetic events may give rise to 
polymorphism, the latter is most commonly observed; alleles at a locus 
generally tend to be more closely related to one another than to those present 
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at other loci (Parham et al. 1988; Parham et al. 1995). The forces driving HLA 
class I polymorphism are believed to be those of overdominant or balancing 
selection (Hughes and Nei 1988; Hughes and Yeager 1998); this is based upon 
values of d N >d s within the coding regions of a x and ot 2 for specificity 
determining pocket residues positioned to interact with the peptide binding 
groove, while the contrary (d s >d N ) is observed among the remaining a x /a 2 
residues. 

Considering the manner by which class I structural polymorphisms are 
generated and maintained demonstrates that HLA genetic variability affords 
both advantages and disadvantages. It is beneficial in ensuring that at least a 
small portion of the human population will possess class I molecules capable of: 
(i) binding immunogenic peptides derived from any given pathogen; (ii) 
presenting those peptides to CTLs; and (iii) evoking a protective imune 
response. In short, annihilation of the species is guarded against by molecular 
diversity (Parham 1992). The concept of heterozygote advantage through 
polymorphism as a mechanism for effectively allowing broader peptide binding 
abilities and thus broader CTL recognition of pathogenic peptides has been 
strongly emphasized from a statistical perspective (Hughes and Nei 1988; Nei 
and Hughes 1992). For example, HLA heterozygosity has been correlated with 
diminished progression to disease following HIV infection (Carrington et al. 
1999). In addition, at the level of individual allotypes the "nonrandomness" 
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with which certain polymorphisms are maintained within populations following 
their evolution supports positive natural selection. This might occur in response 
to specific pathogenic pressures, as seen in the strong association of the West 
African allele HLA-B*5301 with resistance to malaria (Hill et al. 1991; Hill et al. 
1992), a parasitic illness endemic to West Africa. As mentioned previously 
however, the polymorphic nature of class I molecules results in divergent 
allotypes binding and presenting different peptides. CTLs thus focus on distinct 
portions of any given pathogen from one individual to another. Therefore, 
dissecting disease susceptibilities and resistances requires a grasp of how 
binding groove- localized amino acid variations specifically alter ligand 
presentation. 

Numerous previous research endeavors have been directed toward 
understanding the structural and functional nature of peptides bound by HLA 
complexes; though some progress has been made in analyzing the manner that 
peptide binding is specifically influenced through a 1 /a 2 substitutions, this 
knowledge remains limited and sometimes inconsistent. The full extent that 
polymorphisms dictate the degrees of ligand binding ability, stringency, and/or 
degeneracy (and subsequently cell surface presentation) has, as a result, not 
been adequately resolved. Similarly, the occurrence of overlapping ligands, or 
identical peptides presented across the binding groove polymorphisms of 
multiple distinct allotypes, remains to be explored. 
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Ideally, characterizing functional overlaps would provide an advantage not 
only to explore the general effects of binding groove architecture but more 
specifically to understand the similarities and/or differences of what is 
presented to CTLs by the class I molecules of genetically diverse individuals. 
In the search for answers to ligand binding influences by a-chain 
polymorphisms, methods including pooled Edman sequencing, mass 
spectrometry analysis, and binding/reconstitution assays have been employed. 
However, each approach bears its own strengths and limitations and none so 
far has been significantly successful in comparatively evaluating levels of 
functional overlap across class I polymorphisms. The importance of 
understanding peptide associations with polymorphic class I molecules at a level 
of complexity not necessarily afforded by the currently-defined strategies is 
thus underscored: epitope predictions based upon methods that fail to 
accurately assign possibilities for natural binding groove occupancy by either 
aberrant or low-abundance peptides of varying binding affinities interfered with 
detecting presentation overlaps among various HLA class I allotypes. 

Early investigations of class I peptide ligands focused on simplifying the 
effects of polymorphism through establishing "motifs" (Rotzschke and Falk 
1991; Rammensee et al. 1993; Engelhard 1994). Motifs have typically been 
established by purifying surface-expressed class I molecules from detergent 
lysates of either transformed cells or transfectants and extracting the bound 
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ligands with either TFA or acetic acid. The resulting peptide pools are then 
subjected to consecutive cycles of N-terminal Edman degradation (Falk et al. 
1991; Jardetzky et al. 1991). 

The resultant motifs from N-Terminal Edman degradation are invariably 
nine amino acids in length and describe, as based upon relative yield increases 
per cycle, conserved "anchor" residues, or sites of stereochemical preferences, 
for peptides that are bound by a class I molecule. These anchors, typically 
appearing to involve both P2 and P9 of the ligands, are considered to be allele- 
specific and thus common among nearly all of the peptides bound by a given 
class I allotype. The remaining positions demonstrate no overriding amino acid 
preferences, although the motifs of a few molecules demonstrate anchors at P3 
or P5 (Rammensee et al. 1997). The P2 anchor is assumed to associate with 
the B-pocket of the binding groove, while P9, the Oterminal residue 
assignment, associates with the F-pocket. "Auxiliary" or "secondary" anchors 
(alternative positions frequently defined by the occupancy of chemically similar 
residues) are additionally included in the motifs of some class I molecules 
(Rammensee et al. 1997). In general, a common interpretation of this type of 
data Is that endogenous peptide binding and/or loading requires a nonamer 
with particular P2 and P9 anchors. As will be discussed, many searches (and 
consequent failed attempts) for putative viral or tumor class represented 
epitopes have subsequently been predicated upon nonameric templates with P2 
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and P9 anchor assignments. 

Broad efforts have been focused upon establishing and analyzing motifs 
from natural class I ligands (for example, Huczko et al. 1993; Fleischhauer et 
al. 1994; Kubo et al. 1994; Barber et al. 1995; Steinle et al. 1995; Barber et 
al. 1996; Tzeng et al. 1996; Tieng et al. 1997; Yague et al. 1998). However, 
motifs fail to reflect the true complexity of peptides presented by divergent 
class I molecules. Drawbacks are evident in that numerous characterized 
peptide sequences that bind class I are greater than 9 residues long, with 14 
being the largest identified to date (Engelhard 1994); The binding of longer 
ligands likely results from stable associations at anchor positions coupled with 
central protrusion of the peptide outward from the groove (Fremont etal. 1992; 
Guo et al. 1992; Madden et al. 1993). Furthermore, other examinations of 
specific peptides naturally presented by class I MHC of both humans and mice 
have indicated that some fail to comply with their respectively defined motif 
anchors (Calin-Laurens etal. 1993; Henderson etal. 1993; Sadovnikova etal. 
1993; Kawakami et al. 1994; Urban et al. 1994; Malarkannan et al. 1996; Mata 
et al. 1998), thus suggesting that both length variance and "nonanchor" 
residues in the peptide could play significantly more prominent roles in binding 
than strictly accounted for by a given pooled motif alone. Studies have also 
indicated that low copy-number peptides, presented by only a small proportion 
of the total class I molecules expressed upon the cell surface, can successfully 
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elicit CTL responses (Cox et al. 1994; Malarkannan et al. 1996; Wang et at. 
1997). Issues such as these clearly reflect the limitations posed in applying 
Edman sequencing to the complex mixtures of peptides extracted from class I 
molecules (Stevanovicand Jung 1993). Asa result, pooled Edman sequencing 
is therefore unable either to precisely characterize individual ligands or to 
effectively identify overlaps in ligand presentation. 

Shortly after the debut of examining class I ligands by pooled Edman 
sequencing, the first reports of ligand analyses via mass spectrometric 
sequencing of organic ions from similarly-prepared class I ligand extracts began 
to emerge (Henderson et al. 1992; Hunt et al. 1992; Huczko et al. 1993; 
Appella et al. 1995). The utilization of MS/MS on a triple quadrupole mass 
spectrometer provided for the precise characterization of individual constituents 
at sub-picomolar levels from pools of class I peptides, in contrast to the 
picomolar detection limits of Edman analysis. Furthermore, LC/MS prior to this 
step allowed for complexity estimates to be made. For example, based upon 
the quantitation of 200 different peptides present within HLA-A*0201 extracts, 
extrapolation with regard to the peptides detected versus their respective 
contributions to the mass spectrometric TIC obtained indicated that at least 
1,000 and perhaps as many as 10,000 unique peptides are bound by this class 
I allotype (Hunt et al. 1992). The ability to characterize ligands as such 
additionally assisted in being able to identify and sequence single specific 
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epitopes from RP-HPLC fractions demonstrating biological activity via CTL 
assays (Rotzschke et al. 1990; Henderson et al. 1993; Kawakami et al. 1994; 
Skipper et al. 1996; Simmons et al. 1997; Wang et al. 1997; Hogan et al. 
1998; Paradela et al. 1998). Examination of ligands by mass spectrometry was 
therefore an effective development in both starting to fill the gaps often present 
in pooled motifs and expediting the classification of ligands potentially bearing 
immunological significance. 

However, the routine application of mass spectrometric techniques to 
class I ligand examination has remained relatively isolated; it is practiced in 
only a handful of laboratories (for example, Woods et al. 1995; Simmons et al. 
1997; Flad et al. 1998; van der Heeft et al. 1998; Yague et al. 1998). This 
appears largely due to the inherent difficulties imposed in handling the small 
quantities of peptides extracted for study (Henderson et al. 1993; Appella et al. 
1995; van der Heeft et al. 1998) as well as the notably tedious nature of the 
subsequent data processing (Papayannopoulos 1995; van der Heeft et al. 
1998). Other issues concern the specific instrumentation and its mode of 
operation, since: (i) the ability to consistently identify peptides from particular 
extracts relies upon dependable RP-HPLC gradients for separation; and (ii) the 
possible ionization and/or detection interfaces function in distinct manners that 
can influence the data ultimately acquired (Chapman 1996; Watson 1997). In 
summary, understanding the impact of polymorphism upon the binding of 
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endogenous peptdes has historically been limited by small amounts of ligands 
available for analyses. 

What has not been accomplished yet by mass spectrometry is the 
systematic definition of peptides across diverse class I allotypes. Various in 
vitro assays have been developed to complement mass spectrometric 
approaches. The assays are performed using a number of different protocols 
with the common theme of assessing the relative abilities in vitro of 
synthetically defined peptides to associate with specific class I cx-chains and 
3 2 m. In the case of binding assays, synthetic peptides as well as a peptide 
standard are incubated with purified class I complexes and tested in their 
capacities to competitively displace the ligands already bound (Chen and 
Parham 1989; Ruppert et ah 1993; Sette et al. 1994). This has likewise been 
performed by stripping ligands from class I complexes expressed on the cell 
surface by acid treatment and then incubating the cells with the synthetic 
peptides (Drijfhout et al. 1995). Another common approach to competitive 
binding assays involves FACS analysis with class I-specific MAbs after incubation 
of synthetic peptides with various cell lines including RMA-S (Townsend et al. 
1989), T2 (Salter et al. 1985), Hmy2.ClR (Storkus et al. 1989), or 721.221 
(Kavathas et al. 1980) transfected to express HLA a-chains (Huczko et al. 
1993; Takamiya et al. 1994; Zeh III et al. 1994; Boisgerault et al. 1996; Konya 
et al. 1997). Alternatively, de novo reconstitution assays can be performed by 
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incubating synthetic peptides with free a-chains and p 2 m and comparing the 
resultant quantities of free versus complexed a-chains (Tanigaki 1992; Fruci 
et al. 1993). Other methods of reconstitution have also been applied (Silver et 
al. 1991; Parker et al. 1994; Gnjatic et al. 1995; Robbins et al. 1995; Tan et 
al. 1997). 

While these in vitro assays with synthetic ligands have led to the 
development of extensive "supermotifs", or enhanced motifs that ascribe 
binding preferences to particular class I allotypes or groups of related allotypes 
(del Guercio et al. 1995; Sidney et aL 1996a; Sidney et al. 1996b), they 
present obvious limitations. A technical concern is that binding/reconstitution 
determinations are often based upon either "standard" probe peptides for each 
allotype examined or mean experimental values that can significantly differ 
from one laboratory or experiment to another. A specific illustration is based 
on the results generated by three different groups testing HLA-A*0201 with 
panels of synthetic peptides; the assignment of residues preferred or 
deleterious for binding at given positions varied widely (Ruppert et al. 1993; 
Parker etal. 1994; Drijfhout etal. 1995). In fact, only three residue similarities 
carried across all three studies. 

Ruppert and colleagues and Drijfout and colleagues each employed 
competitive binding assays; however, the former involved assessing the ability 
of synthetic peptides to inhibit binding of a radfoiodinated standard peptide to 
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membrane-extracted class I complexes, while the latter involved stripping 
peptides from class I complexes on B-LCLs and determining by FACS analysis 
the ability of synthetic peptides to inhibit binding of a different fluorescence- 
labelled standard. On the other hand, Parker and colleagues employed a 
reconstitution assay which measured the incorporation of radioiodinated p 2 m 
into complexes refolded using synthetic peptides and a-chains prepared from 
Escherichia coli inclusion bodies. Of perhaps greater significance, these types 
of assays fail to account for the processing/loading physiology of trimolecular 
complex formation within the cell (Hogan et ai. 1988); indeed, differences have 
been documented in performing comparative examinations by pooled Edman 
sequencing and mass spectrometry upon naturally-extracted versus artificially 
bound peptides (Davenport et al. 1997). In one specific case, an 
immunodominant HIV peptide divulged through other mechanisms completely 
failed to demonstrate binding to its restricting allotype via an in vitro assay 
(Tsomides et al. 1991). Conclusions regarding Hgand presentation that are 
drawn from bindlng/reconstitution assays thus cannot be applied generally 
without caveat. 

Thus, the present invention(s) aim to provide a methodology for the 
production of soluble MHC Class I and II molecules from either gDNA or cDNA 
starting material such that the structural and functional impact of HLA class I 
polymorphism on peptide binding can be assessed and, In particular, to test 
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how natural ligand presentation overlaps exist in varying degrees across the 
polymorphisms of divergent class I binding grooves. Furthermore, the soluble 
MHC molecules can be used in the functional testing of CTL binding assays and 
vaccine deveopment. Utilizing this specification, one of ordinary skill in the art 
will able to: (i) generate ligands and hence ligand maps from the peptide pools 
extracted from series of distinct yet related class I HLA-B15 allotypes; (ii) 
compare the different ligand maps to identify potentially shared elements; and 
(iii) characterize the elements identified to positively or negatively validate the 
occurrence of overlapping ligands. One of ordinary skill in the art, given the 
present specification, will also realize that the ability to produce soluble MHC 
molecules from either gDNA or cDNA starting material also will allow for other 
useful assay and vaccine development such as the functional testing of peptide 
ligands of interest to determine if, when, and how such peptide ligands provoke 
and/or stimulate an immune response. All of which is directed toward the goal 
of identifying candidate peptide ligands that may be used singly or together as 
a vaccine and/or immune system primer. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

FIG. 1 is a graphical representation of MHC class I location and heavy 
chain coding region. (A) Simple map of the human MHC region, specifically 
highlighting the B, C, and A loci that encode the class I heavy chains as 
simplfied from (Janeway and Travers 1994). Genetic distances are estimated 
in kilobases. Other genes (not shown) including heavy chains and 
transporter/chaperone proteins (class II) and complement proteins and 
cytokines (class III) are encoded within the remaining MHC regions. (B) The 
basic exon structure of MHC class I transcripts. A total of seven exons encode 
the leader peptide (1), <x x domain (2), a 2 domain (3), cc 3 domain (4), 
transmembrane domain (5), and cytoplasmic domains (6-7). Specific exon 
regions are indicated in parentheses for HLA-B*15011 (accession number 
U03859, Hildebrand et al. 1994), which encodes the HLA-B*1501 heavy chain. 

FIG. 2 is a three-dimensional graphical representation of the configuration 
of the MHC class I extracellular domains. (A) The complete extracellular portion 
of the class I molecule HLA-B*3501, revealing the three domains (a lf a 2 , and 
a 3 ) of the heavy chain as well as the noncovalently associated light chain, J3 2 m. 
Top (B) and side (C) views of the basket-like antigen binding groove formed by 
cx x and oc 2 ; the a-helix and (3-sheets for each domain are respectively indicated 
in (B). These three images were rendered from the HLA-B*3501 Protein Data 
Bank (Brookhaven National Laboratory) crystal structure coordinate file, IAIN 
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(Smith et al. 1996a), using the Swiss-PdbViewer 3.01 software (Guex and 
Peitsch 1996). The CHO linked to a 1 residue 86 is not shown. 

FIG. 3 is a graphical representation of the specificity determining pockets 
of the MHC class I ligand binding groove. The structural residues contributing 
to the six pockets (A through F) of the antigen binding groove formed by a t and 
ot 2 are indicated in black and numbered from the N terminus of the mature class 
I cc-chain. The residues are as collectively defined in the literature (Saper et 
al. 1991; Matsumura et al. 1992; Chelvanayagam 1996). In terms of a 
nonameric ligand, the pockets are assumed to accommodate the amino acids 
occuring at given ligand positions as follows: A-pocket, PI; B-pocket, P2; C- 
pocket, P6; D-pocket, P3; E-pocket, P7; and F-pocket, P9. The majority of 
residues that contribute to these binding pockets are oriented such that they 
are solvent-inaccessible in mature trimers. 

FIG. 4 is a pictorial representation summerizing MHC class I trimer 
assembly and transport. The typical endogenous processing pathway shown is 
simplified from the description disclosed herein. Major proteins participating in 
the processes are specifically illustrated and labelled. Some alternative 
processing and/or transport mechanisms currently under investigation are 
designated in the diagram by question marks. Assembled trimers travel 
through the Golgi complex prior to vesicular delivery to the surface membrane 
of the cell (not shown). 
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FIG. 5 is a schematic representation of the deduced contact region 
between the MHC class I a x /a z and TCR V a /V p interfaces. From the diagonal 
orientation of the estimated binding footprints of CDRs 1, 2, and 3 (numbered) 
for each of TCR-V a and V p with respect to the top of MHOO^ and oc 2 , the 
general interaction between TCR and MHC appears to predominantly involve 
V a /V p contact with both the class I solvent-accessible helical residues and the 
central portions of the bound ligand (not shown). 

FIG. 6 is a flow diagram of the overall strategy for comparatively mapping 
and characterizing peptide ligands presented by class I HLA. The approach 
taken to address the presence of overlapping ligands across diverse class I 
molecules consisted of three basic parts: (A) sHLA-producing transfectant 
establishment and culture; (B) extraction, purification, and separation of 
ligands; and (C) ion map generation/comparison and characterization of 
individual peptides. Though only two molecules are indicated for simplification, 
numerous additional molecules (as indicated in the specification) were 
simultaneously carried through the steps shown. 

FIG. 7. Hypothesized evolutionary relationships of HLA-B15 allotypes 
according to expressed a-chain polymorphisms. Shown in this scheme are the 
46 allotypes whose amino acid sequences are provided in herein. Solid lines 
indicate single mutagenic events separating given allotypes. Dashed lines 
indicate that greater than one mutagenic event separates given allotypes; 
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intermediaries are indicated by question marks unless more specifically 
suspected (as in the case of a probable B*1539/B17 recombinant). None of the 
line lengths are reflective of mutational rates. 

FIG. 8 is a graphical representation of the localization of antigen binding 
groove substitutions distinguishing the 6*1512, 6*1508, 6*1501, 8*1503, 
B*1518, and B*1510 allotypes. The structural residues of the antigen binding 
groove formed by cx 1 and a 2 which differ among the six alleles according to 
Table 1 are indicated on the ribbon diagram in black and numbered from the N 
terminus of the mature class I a-chain. 

FIG. 9 is a graphical representation summarizing of the Unisyn 
Technologies CP-3000 basic flow path. The system was assembled and 
operated as described herein. Arrows indicate unidirectional media flow. 

FIG. 10 is two graphs showing sHLA production during bioreactor runs 
with two different B*1501 transfectants. Constructs using PSR a . neo and pcDNA3 
for producing soluble 8*1501 were separately transfected into B-LCL 721.221; 
subclones were then cultured in CP-3000 systems as described herein. 
Harvests samples drawn on,the days indicated along the x-axis during each run 
were subjected to ELISA, yielding the representative sHLA production data 
shown. 

FIG. 11 is a graph showing RP-HPLC fractionation of peptides extracted 
from B*1510. The UV trace obtained during separation as described herein of 
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approximately 400 pg of peptides extracted from B*1510 reveals the bulk of 
absorbance to occur along the gradient (black line) between 10-40% buffer B 
(indicated at the right). The three peaks associated with the control dye 
(methyl violet base B), as well as the primary region subjected to intensive 
mapping between B*1501, B*1503, 6*1508, and B*1510 peptides (fractions 
6-19), are indicated. 

FIG. 12 is a schematic showing the generalized components of a triple 
quadrupole mass spectrometer. The basic constituents of the system described 
herein Include: (A) an electrospray source/ionization interface for sample 
introduction; (B) three quadrupoles for ion manipulation, which includes mass 
filtration (Ql or Q3), transmission (all quadrupoles), and/or collision (q2); and 
(C) a detector for amplifying transmitted ion signals so that they can be 
recorded and analyzed. 

FIG. 13 includes two graphs (A and B) and chart C that show the 
identification and verification of a ligand overlap across divergent HLA-B15 
molecules. NanoES-MS spectral ion maps obtained individually from RP-HPLC 
fraction 8 for each of B*1501, B*1503, 8*1508, and B*1510 were aligned for 
comparison; an expanded view of the range 495-555 m/z, oramu, is shown in 
(A). The ion mass centered at 517.2 m/z matched across the spectra of 
B*1501, B*1503, and B*1508 (top three panels) but not B*1510 (bottom 
panel). This ion was subsequently selected for NanoES-MS/MS from fraction 8 



-29- 



WO 02/062846 PCTYUSO 1/49744 

of the first three class I MHC molecules. The homologous spectra resulting 
from fragmentation of this [M+2H] 2+ ion (B) classified the peptide as a positive 
match, or ligand overlap, occurring across 8*1501, 8*1503, and B*1508 and 
allowed for primary sequence derivation (C). N- and C-terminal peptide 
fragments present in all three NanoES-MS/MS spectra are labelled according to 
standard nomenclature (Roepstorff and Fohlman 1984) in the top panel of (B) 
and underlined in (C); immonium tons are indicated by their single-letter amino 
acid codes in (B), and the sequences of internal cleavage products are also 
specified. 

FIG. 14 is a graphical representation showing the reproducibility of the 
class I HLA ligand mapping and characterization strategy disclosed herein. 
Approximately 400 pg each of B*1508 ligands obtained from two separate 
bloreactor runs performed six months apart were fractionated by RP-HPLC as 
shown for B*1510 (FIG. 11). NanoES-MS ion mapping and comparison were 
then performed as described herein. As illustrated by the spectra of fraction 15 
for each (A and B), the ion maps were consistent with one another; subjecting 
an ion (514.0) from each B*1508 fraction to NanoES-MS/MS further 
demonstrated reliability of the protocols employed. 

FIG. 15 is a tabulation of the pooled Edman sequencing motifs for 
peptides extracted from B*1501 purified by two different MAbs. Data derived 
from the eluates of B*1501 complexes purified using either W6/32 (A) or BBM.l 
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(B) indicated amino acid residues that had been previously attributed to the 
corresponding positions in B*1501 motifs (solid characters), as well as a 
residue which was not included in any previous motif descriptions (Falk et al. 
1995; Barber et al. 1996). Edman degradation was carried out as described 
hereinafter, and relative picomoJar fold increases are indicated in parentheses 
to the right of each residue; amino acids demonstrating a 2.0 to 3.5-fold 
picomolar increase over the previous degradation cycle were grouped as 
"strong", while those demonstrating an increase of greater than 3.5-fold were 
grouped as "dominant". Dashes indicate that no distinct residues could be 
detected as either dominant or strong at the given positions. 

FIG. 16 is a tabulation and corresponding pictorial representation of 
pooled Edman sequencing motifs for peptides extracted from B*1508, B*1512, 
B*1503, B*1518, and B*1510. Edman degradation was carried out as 
described herein, and the data were analyzed as described hereinbefore with 
respect to FIG. 15. Ribbon diagrams of the class I antigen binding groove to 
the right of each motif show the structural residues of the a x /a 2 antigen binding 
groove among each of the alleles which differ with respect to B*1501 according 
to Table 1; they are indicated in black and numbered from the N terminus of 
the mature class I a-chain. 

FIG. 17 is a graphical showing RP-H PLC separation of ligandsfrom BBM.l- 
purified B*1501. The UV trace obtained during separation as described herein 
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of approximately 150 |jg of peptides extracted from BBM.l-purified B*1501 
reveals the bulk of absorbance to occur along the gradient (black line) between 
10-30% buffer B (indicated at the right). The fractions collected are numbered; 
although all fractions were examined, only those subsequently selected for 
analysis and data presentation in FIGS. 18 and 19 are marked by dots. It is 
noted that the chosen fractions were distributed evenly across the entire region 
of interest and included a wide variety that were of high as well as low UV 
absorbance and/or resolution. 

FIG. 18 is four graphs showing the percentages of RP-HPLC ligand 
fractions from BBM.l-purified B*1501 demonstrating particular amino acid 
occupancies by Edman degradation. Sequence complexity among the peptides 
was summarized by averaging the frequency of occurrence for amino acid 
residues at P2 through up to P12 for the fractions marked by dots in FIG. 17. 
Amino acids are grouped along the x-axis according to their physicochemical 
natures (charged, polar, or hydrophobic), as specifically designated in the chart 
for P10 and indicated within all four charts by differential shading. Residues 
observed at P2 and P9 in former B*1501 motif descriptions are indicated in bold 
italics. Since it was not derivatized, cysteine was undetectable and is therefore 
excluded. 

FIG. 19 is four tabular examples of Edman sequence complexity among 
RP-HPLC ligand fractions from BBM.l-purified B*1501. While each of the 
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fractions marked by dots in FIG. 15 was subjected to Edman sequencing, the 
results obtained from fractions 10, 15, 28, and 31 are shown here to specifically 
illustrate some of the diverse trends observed among constituents of the ligand 
mixture. Dominant and strong assignments were made as described 
hereinbefore for FIG. 13; weak assignments were made for amino acids 
demonstrating a 1.5 to less than 2.0-fold picomolar increase over previous 
cycles. No assignments are shown for PI due to lower confidence since 
comparison with former cycles was impossible. The positional assignments in 
fraction 15, which are identified in bold italics, appear to correspond to a 
hexamer, IAVGYV, derived from HLA class I a-chain 23 _ 2 8 (Prilliman et al. 1998). 

FIG. 20 is five graphs summarizing teh length diversity among the 449 
characterized HLA-B15 ligands. Graphed data from the ligands listed in Tables 
A-E summarizes length diversity among the peptides respectively characterized 
from B*1501, B*1503, B*1508, B*1510, and B*1512. 

FIG. 21 is two graphs showing N- and C-regional diversity observed 
through alignments of B*1501 ligands. Frequency of occurrence for amino 
acids at the first (A) and final (B) four positions among the ligands 
characterized from B*1501 (Table A). The graphs were generated from 
separate N- and Oterminal data alignments. 

FIG. 22 is two graphs showing N- and C-regional diversity observed 
through alignments of B*1503 ligands. Frequency of occurrence for amino 
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acids at the first (A) and final (B) four positions among the ligands 
characterized from B*1503 (Table B). The graphs were generated from 
separate N- and C-terminal data alignments. 

FIG. 23 is two graphs showing N- and C-regional diversity observed 
through alignments of B*1508 ligands. Frequency of occurrence for amfno 
acids at the first (A) and final (B) four positions among the ligands 
characterized from B*1508 (Table C). The graphs were generated from 
separate N- and C-terminal data alignments. 

FIG. 24 is two graphs showing N- and C-regional diversity observed 
through alignments of B*1510 ligands. Frequency of occurrence for amino 
acids at the first (A) and final (B) four positions among the ligands 
characterized from B*1510 (Table D). The graphs were generated from 
separate N- and C-terminal data alignments. 

FIG. 25 is two graphs showing N- and C-regional diversity observed 
through alignments of B*1512 ligands. Frequency of occurrence for amino 
acids at the first (A) and final (B) four positions among the ligands 
characterized from B*1512 (Table E). The graphs were generated from 
separate N- and C-terminal data alignments. The information shown here for 
B*1512 ligands is skewed in relation to that obtained for ligands from the other 
four allotypes since (i) a comparatively fewer B*1512 peptides were sequenced, 
and (ii) over half of the B*1512 peptides characterized were specifically 
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investigated as ion map differences with B*1501. 

FIG. 26 is a graphical and tabular representation showing ligand overlaps 
identified by NanoES-MS mapping and characterized by NanoES-MS/MS during 
comparative analysis of 6*1508, 6*1501, B*1503, and B*1510 extracts. 
Ribbon diagrams of the class I antigen binding groove indicate residue 
substitutions (black, numbered from the first residue of the mature cc-chain) 
between each B15 molecule with respect to B*1501. The peptides are 
categorized into three different groups from top to bottom as follows: ligands 
that overlap B*1508 and B*1501; ligands that overlap B*1508, B*1501, and 
B*1503; and ligands that overlap B*1501 and B*1503. Dashes represent 
positions at which amino acids could not be unambiguously assigned through 
the NanoES-MS/MS fragmentation patterns and/or Edman data obtained. 
Ligand residues that coincide with dominant and strong amino acids for the 
given motifs (FIGS. 15 and 16) are indicated in bold. 

FIG. 27 is a pictorial representation of the proposed N-proximal and C- 
terminal anchoring of a nonamer overlapping B*1508 # B*1501, and B*1503. * 
The shared C-terminal anchoring preferences for Tyr in the NQZHGSAEY ligand 
among B*1508, B*1501, and B*1503 as defined by their respective motifs 
(FIGS. 15 and 16) are shaded black, while the varied N-proximal anchoring 
preferences likewise reflected in the motifs are shaded gray. Ligand residues 
are numbered sequentially from the N terminus. 
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FIG. 28 is a pictorial schematic summarization the summary of structure- 
function relationships among the HLA-B15 allotypes from which ligands were 
characterized. HLA-B15 allotypes for which a motif is known are indicated in 
bold, while allotypes with currently undefined motifs are indicated in italics. 
Arrows are representative of single mutagenic events (which are specifically 
defined for each arrow); bold arrows emphasize evolutionary pathways between 
structures for which motifs have been described either here or elsewhere. P2 
and P9 designations are listed. Tentative designations have been made for the 
italicized allotypes by extrapolation from known motifs In light of the 
polymorphisms present. P2/P9 specificities for B*1519 are based upon 
B*1512. P2/P9 specificities for B*1539 are based upon what has been 
observed between 6*4402 and B*4403 (Fleischhauer et al. 1994). P2/P9 
specificities for B*1529 are based upon B*1508. P2/P9 specificities for B*1523 
are based upon what has been observed between B*1502 and B*1513 (Table 
7), as well as observations between B*0801 and B*0802 (Arnett et al. 1998). 
P2/P9 specificities for B*1547 are based upon the relationship previously 
discussed between B*1503 and B*4801 (Martinez-Naves et al. 1997). P2/P9 
specificities for B*1537 are based upon B*1402 (DiBrino et al. 1994). The 
three sections (A, B, and C) into which the molecules have been divided are 
discussed in the text. For reference, a ribbon diagram of the class I antigen 
binding groove indicating residue substitutions (black, numbered from the first 
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residue of the mature a-chain) between the molecules with respect to B*1501 
is provided. 

FIG. 29 is a pictorial representation of the PCR strategy for the production 
of soluble HLA from gDNA according to the methods of the present invention. 

FIG. 30 is a gel image of the primary PCR of 3A394. 

FIG. 31 is a gel image of the secondary PCR of 3A394. 

FIG. 32 is a gel image showing 3A394 and pcDNA3.1 digested with EcoR 
I and Xba I. 

FIG. 33 is a gel image showing the restriction digests of 3A394 clones. 

FIG. 34 is a graph showing the comparative binding of three monoclonal 
antibodies to four soluble HLA molecules. 

FIG. 35 is a pictorial representation of the MHC 

FIG. 36 is a pictorial representation of an HLA molecule. 

FIG. 37 is a pictorial representation of the HLA binding groove that binds 
antigenic peptides. 

FIG. 38 is a pictorial representation of the antigen processing and 
assembly of the MHC class I/peptide complex. 

FIG. 39 is a pictorial representation of HLA peptide loading and movement 
of the HLA molecule to the cell surface. 

FIG. 40 is a pictorial representation of a tetramer biotinylated sHLA 
complex of the present invention. 
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FIG. 41 is a pictorial representation of a bacterial expression vector encoding a 
biotinylation substrate peptide sequence. 

FIG. 42 is a pictorial representation of the biotinylation of sHLA. 

FIG.43 is a pictorial representation of the conjugate used to confirm biotinylation 
of the sHLA molecule. 

FIG. 44 is a graph showing production of sHLA-B* 0702 by T 2 transfectants after 
peptide pulsing. 

f 

FIG. 45 is a graph showing the elution curve of the Elisa assay used to confirm 
sHLA production. 

FIG.46 is a graph showing that an increase in time results in greater 
biotinylation. 

FIG. 47 is a graph showing separation of biotinylated class I from free biotin. 
BRIEF DESCRIPTION OF THE TABLES 

Tables 1-24 and Tables A-E are set forth below on Pages 156-192. 

Table 1. Amino acid differences among the B*1512, 6*1508, B*1501, 8*1503, 
B*1518, and B*1510 allotypes. Residues, noted across the top of the table, are 
numbered from the N terminus of the mature class I heavy chain for each; the 24 
amino acid leader sequence is therefore not included. The residue positions are 
differentially highlighted in black (a t ) or gray (0C 2 ). Positions of identity with the 
consensus sequence (italicized) drawn from B*1501 are indicated by dashes. 

Table 2. PCR and sequencing primers for creating and verifying sHLA 
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constructs. Designated restriction enzyme cut sites are underlined on the 5' 
and 3' PCR primers, and the regions of the 3' PCR primers that inserted a stop 
codonn at position 300 are shown in bold italics. Sequencing primers were 
Cy5-labelled; the regions that they either sequenced through or hybridized with 
are indicated in parentheses. 

Table 3. Additional P2, P9, and length variability revealed through Edman 
sequencing of RP-HPLC ligand fractions from W6/32-purified B*1501, B*1508, 
5*1503, and B*1510. The number of fractions for each which were subjected 
to Edman degradation as described herein and shown in FIG. 13 are shown; 
ligands present in the fractions from all four molecules demonstrated evidence 
of (i) expanded P2 and P9 occupancy, and (ii) sequence beyond the 9 residues 
described by their respective motifs (as shown in FIGS. 15 and 16). 

Table 4. HLA-B15 ligands identically matching database source proteins, 
by category. The ligands are categorized according to source protein functions. 
Residues for the specific ligands are numbered from the initiating Met of their 
respective source proteins. Ligands from HLA-B15 molecules not studied here 
(B*1502, B*1509, and B*4601) are referenced herein. 

Table 5. Ligands unique to B*1512, as compared with B*1501. 
Positively-charged PI residues are indicated in bold. Dashes represent 
positions at which amino acids could not be unambiguously assigned through 
the NanoES-MS/MS fragmentation patterns and/or Edman data obtained. 
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Underlined residues designate tentative assignments. 

Table 6. Ligand binding groove residues interacting with the B-pocket 
among HU\-B15 allotypes bearing known Edman-derived motifs. Residues, 
noted across the top of the table, are numbered from the N terminus of the 
mature class I heavy chain for each. The residue positions are differentially 
highlighted in black (a x ) or gray (cx 2 ); vertical hatching immediately below 
further distinguishes those residues located on a helix. The molecules are 
grouped according to dominant/strong P2 motif similarities. Structural residues 
specifically discussed in the text are indicated in bold italics. 

Table 7. Ligand binding groove residues interacting with the F-pocket 
among HLA-B15 allotypes bearing known Edman-derived motifs. Residues, 
noted across the top of the table, are numbered from the N terminus of the 
mature class I heavy chain for each. The residue positions are differentially 
highlighted in black (a x ) or gray (a 2 ); vertical hatching immediately below 
further distinguishes those residues located on a helix. The molecules are 
grouped according to dominant/strong P9 motif similarities. Structural residues 
specifically discussed herein are indicated in bold italics. 

Table 8. B*1510 Hgands exhibiting Pro with Ala and/or Val in various C- 
proximal combinations. C-proximal occurrences of Pro, Ala, and Val are 
indicated in bold. Dashes represent positions at which amino acids could not 
be unambiguously assigned through the NanoES-MS/MS fragmentation patterns 
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and/or Edman data obtained. Underlined residues designate tentative 
assignments. 

Table 9. B*1501 ligands: N- and C-regional occupancies observed at 
>10% among 126 ligands. The positional occupancy percentages for residues 
among the ligands from Table A greater than 10% are specifically shown; 
dashes indicate occupany rates below 10% for the given side chains. N- 
regional positions and values are highlighted in black, while C-regional 
positions and C surp values are highlighted in gray. 

Table 10. B*1503 ligands: N- and C-regional occupancies observed at 
>10% among 74 ligands. The positional occupancy percentages for residues 
among the ligands from Table B greater than 10% are specifically shown; 
dashes indicate occupany rates below 10% for the given side chains. N- 
regional positions and N sum values are highlighted in black, while C-regional 
positions and C sum values are highlighted in gray. 

Table 11. B*1508 ligands: N- and C-regional occupancies observed at 
>10% among 96 ligands. The positional occupancy percentages for residues 
among, the ligands from Table C greater than 10% are specifically shown; 
dashes indicate occupany rates below 10% for the given side chains. N- 
regional positions and l\l sum values are highlighted in black, while C-regional 
positions and values are highlighted in gray. 

Table 12. B*1510 ligands: N- and C-regional occupancies observed at 
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>10% among 123 Iigands. The positional occupancy percentages for residues 
among the iigands from Table D greater than 10% are specifically shown; 
dashes indicate occupany rates below 10% for the given side chains. N- 
regional positions and N sum values are highlighted in black, while C-regional 
positions and C sum values are highlighted in gray. 

Table 13. B*1512 Iigands: N- and C-regional occupancies observed at 
>10% among 30 Iigands. The positional occupancy percentages for residues 
among the Iigands from Table E greater than 10% are specifically shown; 
dashes indicate occupany rates below 10% for the given side chains. N- 
regional positions and N sum values are highlighted in black, while C-regional 
positions and C sum values are highlighted in gray. 

Table 14. Overlaps identified through RP-HPLC/NanoES-MS ligand mass 
mapping and characterized by NanoES-MS/MS. Dashes represent positions at 
which amino acids could not be unambiguously assigned through the NanoES- 
MS/MS fragmentation patterns and/or Edman data obtained. Underlined 
residues designate tentative assignments. 

Table 15. Overlap frequencies observed between B*1501 and B*1512, 
B*1508, B*1503, or B*1510. The frequencies of ligand presentation overlap 
for the molecules listed with respect to B*1501 were determined from the 
number of total matching ion masses subjected to NanoES-MS/MS and the 
number of those ions collided analyzed by NanoES-MS/MS confirmed as actual 
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overlaps. The resulting values are conservative since not all ions selected for 
examination successfully yielded fragments that could be used for evaluating 
the sequence positivity of mass matches. 

Table 16. Potential B*1501 epitopes selected from the EBV gp85 
structural antigen. The EBV gp85 protein (accession number 1334905, Arrand 
et al. 1981) was manually scanned for epitopes with consideration to B*1501. 
In the first column, epitope candidates matching the length and sequence 
constraints of the B*1501 pooled motif (FIG. 15) are listed. In the second 
column, epitope candidates matching the B*1501 motif-prescribed P2 and C- 
terminal occupancies but demonstrating relaxed length constraints (7 to 11 
residues, according to the B*1501 panel in FIG. 20) are listed. In the third 
column, epitope candidates matching the B*1501 motif-prescribed nonameric 
length but demonstrating P2 flexibility (according to FIG. 21A) are listed. 

Table 17 PCR of sHLA from gDNA primers. 

Table 18 clone sequencing 5'CYS primers. 

Table 19 optical density readings and concentration of DNA extracted 
from sample 3A394. 

Table 20 optical density readings of positive clones. 

Table 21 optical density readings of AF/102 plasmid extracted. 

Table 22 viability of cells after two (2) days. 

Table 23 Elisa results for positive transfectants exhibiting G418 
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resistance. 

Table 24 HLA types and 
Table A derived peptide 
Table B derived peptide 
Table C derived peptide 
Table D derived peptide 
Table E derived peptide 
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sHLA molecules preduced. 
ligands from B*1501. 
ligands from B*1503. 
ligands from B*1508. 
ligands from B*1510. 
ligands from B*1512. 
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DETAILED DESCRIPTION OF THE INVENTION 

Before explaining at least one embodiment of the invention in detail by 
way of exemplary drawings, experimentation, results, and laboratory 
procedures, it is to be understood that the invention is not limited in its 
application to the details of construction and the arrangement of the 
components set forth in the following description or illustrated in the drawings, 
experimentation and/or results. The invention is capable of other embodiments 
or of being practiced or carried out in various ways. Also, it is to be understood 
that the phraseology and terminology employed herein is for the purpose of 
description and should not be regarded as limiting. 

As described hereinabove, characterizing naturally processed HLA class 
I and class II ligands is a key element behind the basic understanding of how 
polymorphism impacts ligand presentation. However, technical and scientific 
challenges including both extreme sample heterogeneity and limited sample 
sizes complicate such examinations. Thousands of distinct peptides are present 
within a ligand extract prepared from a single type of class I molecule, and the 
immunoprecipitation/extraction protocols typically employed to recover peptide 
ligands yield sparse quantities on the order of ~20 \jg (Hunt et al. 1992; 
Henderson etal. 1993). These factors often require specialized biochemical 
expertise not necessarily available in either the common laboratory or core 
facility. 
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Class I major histocompatibility complex (MHC) molecules, designated 
HLA class I in humans, bind and display peptide antigens upon the cell surface. 
The peptides they present are derived from either normal endogenous proteins 
("self") or foreign proteins ("nonself"), such as products of malignant 
transformation or intracellular pathogens such as viruses. In this manner, class 
I molecules convey information regarding the internal fitness of a cell to 
immune effector cells including but not limited to CD8 + cytotoxic T lymphocytes 
(CTLs), which are activated upon interaction with "nonself" peptides and which 
lyse or kill the cell presenting such w nonself" peptides. 

Class II MHC molecules, designated HLA class II in humans, also bind and 
display peptide antigens upon the cell surface. However, unlike class I MHC 
molecules which are expressed on virtually all nucleated cells, class II MHC 
molecules are normally confined to specialized cells, such as B lymphocytes, 
macrophages, dendritic cells, and other antigen presenting cells which take up 
foreign antigens from the extracellular fluid via an endocytic pathway. 
Therefore, the peptides they bind and present are derived from extracellular 
foreign antigens, such as products of bacteria that multiply outside of cells, 
wherein such products include protein toxins secreted by the bacteria that have 
deleterious and even lethal effects on the host. In this manner, class II 
molecules convey information regarding the fitness of the extracellular space 
in the vicinity of the cell displaying the class II molecule to immune effector 
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cells including but not limited to CD4 + helper T cells, which help eliminate such 
pathogens both by helping B cells make antibodies against microbes as well as 
toxins produced by such microbes and by activating macrophages to destroy 
ingested microbes. 

Class I and class II HLA molecules exhibit extensive polymorphism, which 
is generated by systematic recombinatorial and point mutation events; as such, 
hundreds of different HLA types exist throughout the world's population, 
resulting in a large immunological diversity among the population. Such 
extensive HLA diversity in the population results in tissue or organ transplant 
rejection between individuals as well as differing susceptibilities and/or 
resistances to infectious diseases. HLA molecules also contribute significantly 
to autoimmunity and cancer. Because HLA molecules mediate most, if not all, 
adaptive immune responses, HLA proteins are needed to study transplantation, 
autoimmunity, and for developing vaccines. 

There are several applications in which purified, individual class I and 
class II MHC proteins would be highly useful. Such applications include using 
MHC-peptide multimers as immunodiagnostic reagents for disease 
resistance/autoimmunity; assessing the binding of potentially therapeutic 
peptides; elution of peptides from MHC molecules to identify vaccine 
candidates; screening transplant patients for preformed MHC specific 
antibodies; and removal of anti-HLA antibodies from a patient. Since every 
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individual has different MHC molecules, the testing of numerous individual MHC 
molecules is a prerequisite for understanding differences in disease 
susceptibility between individuals. Therefore, purified MHC molecules 
representative of the hundreds of different HLA types existing throughout the 
world's population are highly desirable for unraveling disease susceptibilities 
and resistances and for designing therapeutics. 

Currently there is no readily available source of individual HLA molecules. 
Until now, the quantities of HLA protein available were small and typically 
consist of a mixture of different HLA molecules. Production of HLA molecules 
traditionally involves growth and lysis of cells expressing multiple HLA 
molecules. Ninety percent of the population is heterozygous at each of the HLA 
loci; codominant expression results in multiple HLA proteins expressed at each 
HLA locus. To purify native class I or class II molecules from mammalian cells 
requires time-consuming and cumbersome purification methods, and since each 
cell typically expresses multiple surface-bound HLA class I or class II molecules, 
HLA purification results in a mixture of many different HLA class I or class II 
molecules. When performing experiments using such a mixture of HLA 
molecules or performing experiments using a cell having multiple surface-bound 
HLA molecules, interpretation of results cannot directly distinguish between the 
different HLA molecules, and one cannot be certain that any particular HLA 
molecule is responsible for a given result. Therefore, a need exists in the art 
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for a method of producing substantial quantities of individual HLA class I or 
class II molecules so that they can be readily purified and isolated independent 
of other HLA class I or class II molecules. Such individual HLA molecules, when 
provided in sufficient quantity and purity, would provide a powerful tool for 
studying and measuring immune responses. 

The present invention envisions a method of producing MHC molecules 
which are secreted from mammalian cells in a bioreactor unit. Substantial 
quantities of individual MHC molecules are obtained by modifying class I or 
class II molecules so they are secreted. Secretion of soluble MHC molecules 
overcomes the disadvantages and defects of the prior art in relation to the 
quantity and purity of MHC molecules produced- Problems of quantity are 
overcome because the cells producing the MHC do not need to be detergent 
lysed or killed in order to obtain the MHC molecule. In this way the cells 
producing secreted MHC remain alive and therefore continue to produce MHC. 
Problems of purity are overcome because the only MHC molecule secreted from 
the cell is the one that has specifically been constructed to be secreted- Thus, 
transfection of vectors encoding such secreted MHC molecules into cells which 
may express endogenous, surface bound MHC provides a method of obtaining 
a highly concentrated form of the transfected MHC molecule as it is secreted 
from the cells. Greater purity can be assured by transfecting the secreted MHC 
molecule into MHC deficient cell lines. 
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Production of the MHC molecules in a hollow fiber bioreactor unit allows 
cells to be cultured at a density substantially greater than conventional liquid 
phase tissue culture permits. Dense culturing of cells secreting MHC molecules 
further amplifies the ability to continuously harvest the transfected MHC 
molecules. Dense bioreactor cultures of MHC secreting cell lines allow for high 
concentrations of individual MHC proteins to be obtained. Highly concentrated 
individual MHC proteins provide an advantage in that most downstream protein 
purification strategies perform better as the concentration of the protein to be. 
purified increases. Thus, the culturing of MHC secreting cells in bioreactors 
allows for a continuous production of individual MHC proteins in a concentrated 
form. 

Although class I and class II ligands were first examined by Edman 
sequencing, the primary characterization of individual ligands has been 
significantly improved upon through MS/MS applications. This is frequently 
performed via LC/MS using a microcapillary (<1 mm i.d) HPLC column directly 
interfaced with a triple quadrupole mass spectrometer (Hunt et al. 1992). The 
rationale behind using columns of small i.d. is that a lower solvent flow rate is 
permitted that ultimately increases the sensitivity of mass spectrometric 
detection (Tomer et al. 1994). 

Unfortunately, since sample load capacity decreases proportionately to 
the column diameter, such columns and LC/MS methods can be technically 
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difficult or inconsistent for laboratories not routinely employing them to 
operate, more robust protocols for producing and studying class I-derived 
peptides have been desired. The methodology of the present invention, as 
described herein, solves and/or meets this need in the art by a methodology 
which increases the quantities of ligands extractable by producing recombinant 
soluble class I and class II molecules. The sample amounts subsequently 
available offset handling losses at the bench, which have been estimated at 
50% (Veronese et al. 1996; Hogan et al. 1998; and Dr. Peter Parham, 
unpublished observations), and obviate the need for microcapillary LC/MS prior 
to MS/MS analysis. This is because significantly larger (20-fold) ligand samples 
are instead separated by standard offline RP-HPLC followed by NanoES-MS 
mapping and NanoES- MS/MS sequencing. The methodology of the present 
invention has proven consistent for comparatively examining peptides extracted 
from different class I molecules. As shown in FIG. 6, utilizing the production of 
soluble class I and class II molecules of the present invention, allows one of 
ordinary skill in the art to locate and characterize overlapping ligands among 
distinct allotypes (FIG. 6). 

The initial HLA molecules selected for examination and production were 
from the HLA-B15 family, a hypothetical schematic of which is presented in FIG. 
7. The HLA-B15 family represents a broad and diverse group of molecules 
comprised of nearly 50 evolutionarily related allotypes differing almost 
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sequentially by 1-15 peptide binding groove residues, and they are observed 
throughout numerous ethnic populations (Hildebrand et al. 1994); serological 
and DNA-based typing thus far confirm distribution of B15 alleles among 
Caucasians, Amerindians (North and South), Mexicans, Blacks (African and 
American), Indians, Iranians, Pakistanis, Chinese, Japanese, Koreans, and 
Thais. The majority of HLA B-locus polymorphisms known to exist are 
represented among the members of this allelic family. HLA-B*1501 appears to 
be the "ancestral allele/' 

The specific B15 allotypes initially selected for review and use with the 
present invention were B*1501 (Pohla et al. 1989; Choo et al. 1993; 
Hildebrand et al. 1994; Lin et al. 1996), B*1503 (Domena et al. 1993), B*1508 
(Hildebrand et al. 1994), B*1510 (Domena et al. 1993; Rodriguez et al. 1993; 
Rodriguez et al. 1996), B*1512 (Hildebrand et al. 1994), and B*1518 (formerly 
B*7901, Choo et al. 1991; Lin et al. 1996; Rodriguez et al. 1996) (Table 1 and 
FIG. 8). B*1508 differs from B*1501 by a single mutagenic event in the. a x 
helix, while B*1512 differs by a single mutagenic event in the a 2 helix; the 
remaining three alleles demonstrate a progressive series of polymorphisms 
throughout their binding grooves imposed by sequential mutagenic events 
during their divergent evolution from B*1501. 

Using the primer sets of HLA5UT (Domena et al. 1993) and SHLA3TM 
(Prilliman et al. 1997) or 5PXI and 3PEI (Table 2) and template DNA from 
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reliable full-length cDNA clones of HLA-B15 molecules 6*1501, 6*1503, 
6*1508, 6*1510 # 6*1512, and 6*1518, truncating PCR was performed for each 
on a Robocycler (Stratagene) for 30 cycles as previously described (Prilliman 
et al. 1997). The resultant PCR products contained the leader peptide, a u cc 2/ 
and a 3 coding domains of the HLA heavy chain. 

The PCR products were introduced into mammalian expression vectors. 
Initial constructs (truncated 6*1501, 6*1503, and 6*1508) were prepared with 
the PSROC-neo vector (Lin et al. 1990), which has formerly been used to express 
non-truncated HLA molecules (6arberetal. 1997; Martinez-Naves etal. 1997), 
while other constructs (truncated 6*1501, B*1503, B*1508, 6* 1510, 6*1512, 
and 6*1518) were additionally prepared with either pcDI\IA3 or pcDNA3.1(-) 
(Invitrogen). Constructs using the PSRa-neo vector were made from PCR 
products of the HLA5UT and SHLA3TM primers; the PCR products were 
subcloned into M13 (mpl8 or mpl9) according to standard protocols (Domena 
et al. 1993) so that confirmatory single-stranded DIM A sequencing could be 
performed with Cy5-labelled versions of the primers M13 universal, 4N, and 3N 
(mp 18) or M13 universal, 3S, and JD3S (mpl9), listed in Table 2 and described 
previously (Ennis et al. 1990; Domena et al. 1993), using the AutoLoad 
sequencing kit and ALFexpress automated sequencer (both Amersham 
Pharmacia 6iotech). The insert was then prepared and purified, and this was 
followed by subcloning into PSRa-neo. Constructs using the pcDNA3 vector 
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were made from PCR products of the 5PXI and 3PEI primers; these PCR 
products were subcloned into M13 and sequenced as above, following which the 
insert was subcloned into pcDNA3. Constructs using the pcDNA3.1(-) vector 
were made from products of the 5PXI and 3PEI primers; PCR products were 
directly subcloned into pcDNA3.1(-), following which double-stranded DNA 
sequence analysis was performed with Cy5-labelled versions of the primers 3S, 
4N, T7 promoter, and pcDNA3.1/BGH (Table 2). 

DNA from each of the construct clones was prepared using Qiagen Midi 
kits for transfection of the class I-negative B-LCL 721.221. Cells growing in log 
phase in RPMI-1640 + 2 mM L-glutamine + phenol red + 20% FCS were 
pelleted and electroporation was performed as described (Gumperz et al. 1995) 
prior to beginning selection with 1.5 mg/mL G418. Upon establishment of 
confluent growth after approximately 3 weeks, putative transfectant wells were 
screened for sHLA production using a sandwich ELISA (Prilliman et al. 1997). 
Transfectant wells positive for sHLA production were then subcloned by limiting 
dilution to establish cell lines optimally secreting greater than 1 pg/mL of class 
I molecules in static culture over 48 h. Satisfactorily subcloned transfectants 
were then expanded, frozen in RPMM640 + 20% FCS + 10% DMSO, and 
stored at -135°C. 

Since hollow-fiber bioreactors have been applied in place of in vivo 
hybridoma culture and MAb harvest from ascites (Evans et al. 1996) in order 



-54- 



WO 02/062846 PCT/USO 1/49744 

to continuously produce large quantities of pure immunoglobulins, they were 
utilized to produce and harvest the sHLA of the present invention. The Unisyn 
Technologies CP-3000, the standard flow path and primary components of 
which are diagrammatically simplified in FIG. 9, was selected for hollow-fiber 
bioreactor culture of successfully established transfectants. In this system, 
basal media is pumped into the fully-assembled system from a 200 L barrel; the 
media flows from the 4 L reservoir tank into the hollow-fiber networks of the 
four bioreactors, which provide 2.7 m 2 of surface area per cartridge, and then 
exits as waste. ECS media and sHLA harvest are tandemly pumped into and 
out of the 270 mL cartridges, respectively, with each bioreactor 
receiving/yielding equal media/harvest volumes as regulated by in-line 
solenoids. 

The CP-3000 was set up according to the manufacturer's protocol. After 
the system was completely prepared, at least 1 x 10 9 viable cells of a 
transfectant were grown in roller bottles of RPMI-1640 + 2 mM L-glutamine + 
phenol red + 10% FCS. The cells were pelleted and inoculated into the ECS of 
the bioreactor cartridges. ECS feed and harvest bottles were then attached to 
their corresponding lines, and the basal and recirculation rates were initially set 
to 100 and 1000 mL/h, respectively; the ECS was usually not activated until 24- 
36 h following inoculation. The system was then monitored at least twice daily 
over 4-6 weeks, with adjustments made as necessary. This involved checking 
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the glucose concentration and pH from manually-drawn reservoir samples, 
checking DO readings, and regulating the basal and ECS rates accordingly 
(Prilliman et al. 1997; Prilliman et al. 1998). A fresh harvest sample was 
periodically extracted directly from the system to quantitate sHLA production 
by ELISA for production level monitoring (FIG. 10). 

Cells were cultured in the bioreactor system during each run until the 
desired amount of sHLA had been produced and collected in harvests. Each of 
the bioreactor runs chronologically depicted in FIG. 10 was aborted once 
approximately 150 mg of sHLA was determined by ELISA to be contained in the 
respective harvests collected (Prilliman et al. 1997). The majority of runs were 
performed using 721.221 cells transfected with pcDNA3 or pcDNA3.1 (-) vector 
constructs, considering: (i) the significant differences in time frame between 
the runs performed using cell lines expressing soluble B*1501 from either the 
PSRa-neo or pcDNA3 vectors (3 months versus 1 month); and (ii) the fact that 
peptides extracted during each run produced identical motif results. 

Upon completing a bioreactor run, sHLA complexes were purified from the 
harvests obtained (Prilliman et al. 1997). A 100 mL matrix of either the p 2 m- 
specific MAb BBM.l (Brodsky et al. 1979) or W6/32 (Barnstable et al. 1978) 
coupled to CNBr-activated Sepharose 4B (Amersham Pharmacia Biotech) 
according to the manufacturer's instructions was equilibrated with wash buffer 
(20 mM sodium phosphate, pH 7.2 + 0.02% sodium azide), and harvests were 
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applied to the column using a GradiFrac LC system (Amersham Pharmacia 
Biotech); the load capacities for 100 mL matrices of the MAbs BBM.l and 
W6/32 were approximated at 10 and 40 mg sHLA respectively, as monitored 
for saturation ELISA of screening pre- and post-column samples. The column 
was then washed, eluted with 0.2 N acetic acid, and neutralized with wash 
buffer. Both BBM.l and W6/32 were used to affinity purify B*1501, the first 
molecule prepared. However, due to the differences in purification efficiency 
noted above between the two MAbs, W6/32 alone was employed to isolate the 
remaining molecules from bioreactor harvests. This MAb has been frequently 
used by others in purifying HLA (Falk et al. 1991; Barber et al. 1997). 

The fractions collected during affinity column elution demonstrating UV 
absorbance at 280 nm were pooled, and glacial acetic acid was added to 10% 
volume to extract the peptides as described (Barber et al. 1995). Bound 
peptides were separated from heavy chains, p 2 m, and BSA by passage through 
3 kDa exclusion membrane filters (Amicon) (Prilliman et al. 1997). The ligand- 
containing eluate was then lyophilized. 

To remove residual salts and free amino acids remaining from the 
extraction process, isolated peptides were purified of free amino acids and salts 
prior to fractionation. This was done on a 2.1 x 100 mm C18 column (Vydac) 
with a steep RP-HPLC gradient using a DYNAMAX HPLC system (Rainin). The 
gradient was generated by increasing to 100% buffer B (0.06% TFA in 100% 
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acetonitrile) in 1 min, holding for 10 min, and returning to buffer A (0.1% TFA 
in HPLC-grade water) in 1 min. The column was loaded with peptides 
reconstituted in the minimum volume of buffer A required for solubilization. 
During the run, the region corresponding to absorbance at 214 nm was 
manually collected. Typically l/100th of the total purified ligand collection 
volume was removed and subjected to Edman sequencing for 14 cycles on a 
492A pulsed liquid-phase protein sequencer (Perkin-Elmer Applied Biosystems 
Division) according to established protocols (Falk et al. 1991; Barber et al. 
1995). 

The purified ligands were next fractionated by RP-HPLC. For preliminary 
diversity assessment, approximately 150 pg of peptides, as calculated from the 
ELISA-based total mass of sHLA bound to the affinity column and an estimated 
50% handling loss, were loaded in 10% acetic acid onto a 1.0 x 150 mm C18 
column (Michrom Bioresources, Inc.) and separated using an initial gradient of 
2-10% buffer B (0.085% TFA in 95% acetonitrile) in 0.02 min followed by a 
linear gradient of 10-60% buffer B in 60 min at 40 pL/min on a 2.1 x 150 mm 
C18 column (Michrom Bioresources, Inc.); buffer A was 0.1% TFA in 2% 
acetonitrile. Absorbance was monitored at 214 nm, and fractions were 
automatically collected every minute. For comparative analyses, approximately 
400 pg of peptides were injected in 10% acetic acid containing 2 pg of the dye 
methyl violet base B to control for gradient consistency between runs (FIG. 11). 
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The gradient formation parameters consisted of 2-10% buffer Bin 0.02 min and 
10-60% buffer B in 60 min at 180 pL/min. Absorbance was monitored at 214 
nm, and fractions were automatically collected every minute. Edman 
sequencing of fractions, when performed, was conducted on l/20th of each. 

To map extracted peptides and obtain primary sequences, a triple 
quadrupole mass spectrometer with an ES ion source, as generically depicted 
in FIG. 12, was employed. By using a triple quadrupole instrument, not only 
are all of the ions present within a given fraction be summarized for a 
designated mass range (mass mapping), but ions may then be selectively 
fragmented in order to obtain information from which sequence information can 
be derived (characterization). This is due to the flexibility afforded by the 
quadrupole mass analyzers: Ql and Q3 act as mass filters which can be set to 
generate alternating DC and RF voltage fields for selectively transmitting 
specific ions (Watson 1997). However, q2 is an enclosed transmission-only 
quadrupole; it can be pressurized with inert gas for the collisional dissociation 
of an ion transferred through Ql. The specific ionization interface, NanoES, 
chosen here as an ES source functions on the principles described by 
developers Wilm and Mann (Wilm and Mann 1996). To establish and validate 
the procedure, comprehensive peptide mapping and sequencing were first 
performed among fractions 6 through 19 (FIG. 11), which represented a region 
of relatively rich ligand concentration (data not shown), for B*1501, B*1503, 
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B*1508, and B*1510; once this was accomplished, a more focused, and 
therefore less extensive, comparison was subsequently made between B*1501 
and B*1512. 

Prior to NanoES-MS, RP-HPLC fractions were completely dried by speed 
vac; the peptides were then resuspended in 0.1% acetic acid in 1:1 
methanol: water. Aliquots from each of the individually concentrated fractions 
were loaded into 5 cm. gold/palladium alloy-coated borosilicate pulled glass 
NanoES sample capillaries (Protana A/S). To begin sample flow and data 
collection, the loaded capillary tube was next carefully opened as described 
(Wilm and Mann 1996). The capillary was then positioned directly in front of 
the API III + (PE SCIEX) triple quadrupole mass spectrometer's orifice, and 20- 
30 scans were collected as separate data files for the mass range 325-1400 
m/z while operating the instrument at positive polarity. This procedure was 
performed sequentially to obtain constituent mass data for samples drawn from 
each RP-HPLC fraction. 

Spectral "ion maps" were generated from the TICs acquired for each 
fraction. The maps obtained from corresponding fractions of peptides eluted 
from different HLA-B15 molecules were aligned (FIG. 13A), and ions of interest 
for NanoES-MS/MS were located. The ion maps were typically compared 
following baseline subtraction and smoothing. Putative ligand matches or, in 
the case of B*1512, mismatches among the ions were identified through a 



-60- 



WO 02/062846 PCT/USO 1/49744 

combination of data centroiding and direct visual assessment. Preference was 
placed upon selecting doubly-charged, or [M+2H] 2+ , or higher ion forms 
commonly resulting from electrospray ionization for subsequent NanoES-MS/MS 
since the resulting daughter ion spectra were richer than those obtained from 
the collision of singly-charged, or [M+H] + , ions (data not shown). 

NanoES-MS/MS was performed by loading into a NanoES capillary tip, as 
described above, the desired volume of a fraction for which data was to be 
acquired. The volume loaded depended upon the relative sample flow rate 
achieved after opening the capillary tip and how long data acquisition was 
intended to proceed. Typically 3-4 pL were loaded at a time to collect MS/MS 
data for 20-25 mid- to low-intensity ions from a given fraction. Once loaded, 
the source head was positioned and the capillary opened as before. Separate 
data files were collected for each ion subjected to collisional dissociation. 

Daughter ion spectra were generated from the TICs obtained in this 
manner for each ion chosen. The specific approach taken to interpet individual 
MS/MS spectra varied from ion to ion depending upon the quality of the data 
sets obtained but adhered to the general rules of MS/MS fragment 
interpretation (Roepstorff and Fohlman 1984). The Predict Sequence algorithm 
included as part of the BioMultiVrew software (BioToolBox package, PE SCIEX) 
was employed for the de novo sequence interpretation, and the sequences 
deduced were checked for identity with source proteins in various databases 
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using the PeptideSearch algorithm (European Molecular Biology Laboratory; 
Mann and Wilm 1994) and performing advanced BLAST searches (Altschul et 
al. 1997) against the National Center for Biotechnology Information (National 
Institutes of Health) databases. 

Leu and He were indistinguishable unless suggested by Edman data 
and/or specific sequence matches, as were Gin and Lys since lysyl derivatfzation 
prior to fragmentation was not performed. NanoES-MS/MS data from ions of 
potentially overlapping peptides was aligned to confirm or refute the presence 
of shared ligands among different HLA-B15 molecules, as shown for one ion 
confirmed as an overlapping peptide across B*1501, B*1503, and B*1508 in 
FIG. 13, (B and C). Reproducibility of the protocol in its entirety is 
demonstrated in FIG. 14. To establish a numerical description (N^^G^) 
comparing ligand N- and C-regional occupancies for each allotype, N and C 
values for the four ligand positions at either terminus were determined by 
summing occurrence frequencies (using an arbitrarily-defined baseline of 10%); 
N sum was subsequently calculated from the four N values, and C sum was 
calculated from the four C values. 

Peptides from HLA-B15 molecules were subjected to pooled Edman 
sequencing as well as more extensive examinations, including fractional Edman 
sequencing and mass spectrometric characterization of individual ligands. This 
was done to: (i) confirm the production/purification methods employed; and 
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(ii) evaluate the relative nature and complexity of the peptides contained in 
extracts of naturally presented ligands. 

Upon extracting peptides from each of six different B15 molecules, pooled 
Edman sequencing was performed. This was done both to validate results from 
the extraction of sHLA ligands with the techniques previously employed by 
others (B*1501 and 6*1508, Falk et al. 1995; Barber et al. 1997), and to 
obtain novel motifs from the molecules that had not been previously examined 
(B*1503, B*1510, B*1512, and B*1518) for providing "traditional" points of 
reference. 

Overall, the B*1501 motif (FIG. 15) was in agreement with the dominant 
P2 and P9 anchors (Gin and Tyr/Phe, respectively) previously defined (Falk et 
al. 1995; Barber etal. 1996; Barber etal. 1997). This result demonstrates that 
the peptides extracted from sHLA-B*1501 complexes were identical to those 
extracted by others from natural membrane-bound molecules. However, 
differences in the whole pool sequencing data arising from sHLA purified by 
BBM.l (FIG. 15B) versus W6/32 (FIG. 15A) indicated that a greater number of 
peptides than originally realized should actually contribute to the consensus 
B*1501 motif. 

First, while Gin and other residues including Leu, Met, and Val have been 
previously reported at P2 using W6/32 to isolate complexes, the aliphatic side 
chain Pro was strongly detected at P2 in the BBM.l-purified B*1501 motif as 
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well. Though not employed by other groups pursuing similar studies, the p 2 *ri- 
specific MAb BBM . 1 was initially chosen here to avoid biases potentially imposed 
upon the class I heavy chain by bound peptides (Bluestone et al. 1992; 
Catipovic et al. 1992; Solheim et al. 1993). Of interest was that Pro has not 
been reported before as a strong or even weak P2 anchor in the B*1501 
peptide motif. It has been suggested that B-locus allotypes that present 
peptides with Pro at P2 demonstrate a shallower B-pocket within their binding 
grooves than does B*1501, which exhibits a Ser at a-chain position 67 rather 
than a more constricting residue such as Phe (Barber et al. 1997). 

This data suggests the possibility that Pro binds amicably within this 
deeper pocket but perhaps induces an altered heavy chain conformation that 
negatively biases purification of complexes by the W6/32 MAb typically used. 
The observation of a P2 Pro occupying a pocket with suboptimal physical 
complementarity is corroborated by a similar occurrence among peptides bound 
by the murine class I molecule L d (Corr et al. 1992; Balendiran et al. 1997). 
Purification methodology serves, therefore, as a factor in allele-specific motif 
predictions, and the whole pool sequencing with peptides extracted from both 
BBM.l and W6/32-purified B*1501 complexes demonstrated that a strong Pro 
anchor at P2 is antibody dependent. 

The pooled Edman motifs obtained for W6/32-purified molecules 
divergent from B*1501 are shown in FIG. 16. Like B*1501, each of the motifs 
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reflected a nonameric consensus with distinct P2 and P9 anchors and internal 
auxiliary anchor preferences. The B*1508 motif, described by another group 
while its preparation was in progress during the development of this invention 
(Barber et al. 1997), was consistent between the two laboratories; it 
demonstrated a preference for the small side chains Pro and Ala at P2 and 
aromatic residues Tyr and Phe at P9. The B*1512 motif appeared nearly 
identical to that obtained from B*1501; by extension, considering that B*1519 
differs from B*1512 in a 3 , which does not contribute to the peptide binding 
groove, it is predicted that B*1519 would bear the same motif as B*1501 and 
B*1512. 

B*1503 diverges somewhat from the other three molecules presented 
above in showing a distinct preference for ligands with a neutral, polar Gin or 
positively-charged Lys as the P2 anchor; the aliphatic Met was evident here as 
well, though to a lesser degree than noted for the hydrophilic Gin and Lys 
residues (Prilliman et al. 1999). Like B*1501, B*1508, and B*1512 however, 
aromatic residues Tyr and Phe defined a hydrophobic P9 anchor. The only other 
class I molecules with motifs whose definitions thus far indicate a Lys at P2 are 
B*3902 (Falk et al. 1995) and B*4801 (Martinez-Naves et al. .1997), both of 
which structurally bear B-pockets identical to B*1503 except for a single L-T 
or L->E substitution, respectively, attheoc 2 helical residue 163 (Chelvanayagam 
1996). The B-pocket of B*1503 is indistinguishable from that of B*4802 
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(Cheivanayagam 1996), whose motif remains undetermined but is likely to 
follow suit with those of B*1503 and these other molecules at the second ligand 
position. An assortment of polar, charged, and hydrophobic residues is evident 
at P3 of the B*1503 motif. 

The B*1510 motif demonstrated a strict preference for ligands bearing a 
basic, hydrophilic His as a P2 anchor. A hydrophobic P9 anchor was described 
by residues including Leu and Phe. The B*1510 motif strongly resembled that 
previously defined for B*1509, which exhibits nearly identical anchor 
preferences with His at P2 and Leu, Phe, and Met at P9 (Barber et al. 1997). 
B*1510 and B*1509 differ structurally only by a substitution of N-D in 0C 2 at 
the cc-sheet floor position 114, which takes part in forming several specificity 
pockets within the peptide binding groove (FIG. 3). 

By extrapolation from its structural neighbors, it was assumed that 
B*1518 would have for its motif a P2 anchor of His (as seen for B*1510 and 
B*1509) and a P9 of Tyr and Phe (as seen with 6*1501, B*1503, B*1508, and 
B*1512). B*1518 differs from B*1510 solely at position 116; two other HLA-B 
molecules that differ exclusively at this position are B*3501 and B*3503: they 
differ by a S-F substitution here, which would sterically mimic the substitution 
between B*1518 and B*1510 and confer B*1510-like P9 preferences (Steinle 
et al. 1995; Kubo et al. 1998). Based upon this, and the fact that the P9 
environments of B*1518/B*3501 and 6*1510/6*3503 are similar 
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(Chelvanayagam 1996), it was first predicted, and then confirmed following 
pooled sequencing, that B*1518 would bear the "hybrid motif" described. 

Pooled Edman sequencing data therefore demonstrates that (i) the 
peptides extracted from sHLA complexes produced according to the 
methodology of the present invention are consistent with those previously 
extracted from native, cell surface-expressed complexes, and (ii) nonameric 
ligand lengths with anchor residues at P2 and P9 characteristic to specific 
polymorphisms are preferred. As for functional implications, the major anchors 
would predict natural ligand overlaps with B*1501 by B*1503 and B*1512. 

Since class I peptide pools consist of thousands of different ligands, an 
early investigation during development of the analysis strategy and methods 
disclosed herein was to next fractionate and then Edman sequence the peptides 
extracted from one of the molecules. BBM.l-purified B*1501 (Prilliman, et al. 
1997), the first soluble molecule produced by a non-repeatab!e precursor 
methodology to the fully repeatable and characterized methodology of the 
present invention, was initially examined to explore the general diversity 
around a pooled motif (Prilliman et al. 1998). 

Edman sequencing of peptide-containing fractions collected from the RP- 
HPLC gradient shown and described in FIG. 17 supported the existence of 
peptides up to 12 residues long and revealed significant positional diversity 
among the peptides isolated from B*1501. This positional diversity was 
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illustrated in 16 representative fractions (FIG. 18). Assessment of dominant, 
strong, or weak amino acid residues present at the various positions indicated 
that the dominant anchors previously defined by whole pool sequencing did not 
necessarily predominate in fractions of the peptide pool (FIG. 19). In fractions 
15 and 31 the dominant P2 Gin was replaced by a dominant Ala and a dominant 
Lys respectively, while the pooled motif Tyr fell below residues such as His and 
Lys at P9 In the same fractions. While variations on the consensus motif were 
prevalent at P2, this was not solely restricted to the N termini of bound peptides 
as shown in FIG. 18. 

Among the 16 fractions studied, 12 demonstrated weak sequence yields 
out to 12 cycles of degradation; in nine of these fractions, P12 was occupied by 
the charged or polar residues Glu, Arg, Lys, Ser, His, or Gin. Additionally, the 
presence of numerous decamers within the B*1501 peptide population was 
suggested in that 11 of the fractions sequenced exhibited a typical P9 residue, 
Tyr or Phe, at P10. For example, in FIG. 19 the strong Phe presence at P10 in 
fraction 28 suggests that P10 serves as an anchor for a decamer(s) present 
within the fraction. The shifting of a P9 anchor preference to P10 is consistent 
with the P10 occupancies of individual decamers formerly characterized from 
B*1501 peptide pools (Falketal. 1995; Barber etaL 1997). Although residues 
representing P9 anchors were seen at P10 and P12, amino acids unique in such 
longer ligands were also detected, suggesting that the shifting of a previously- 
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reported P9 anchor is not the only means by which longer B*1501 peptides are 
bound within the peptide binding groove (Collins et al. 1994). 

During the course of examining ligandsfrom other sHLA molecules, which 
were purified alternatively with MAb W6/32, aliquots of RP-HPLC fractions from 
some were also subjected to Edman sequencing on occasion. The results from 
these random samplings of 6*1501, 6*1508, 8*1503, and 6*1510 fractions are 
summarized for major characteristics in Table 3. The characteristics recorded 
included preferences other than those seen in the pooled motifs at traditional 
anchor positions P2 and P9 and whether signal levels indicated the presence or 
not of residues beyond the nine cycles of degradation typically observed. Of 
additional interest, no Lys was observed among Edman sequenced fractions as 
an additional preference at P2 for 6*1501 purified with W6/32, a finding 
contrary to the P2 preferences among fractions of 66M.l-purified material 
which lends further support to the argument for MAb bias, as discussed earlier, 
existing in complex purification. 

In summary, the fractionation of 6*1501 peptides prior to Edman analysis 
resulted in amino acid sequence data demonstrating that the components of a 
peptide pool can vary considerably from the overall motif (Prilliman et al. 
1998). These data, as well as results obtained from other 615 molecules, 
suggested that peptides bound by the various molecules include: (t) species 
which are either longer or shorter than the nonameric size typically indicated 
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by pool sequencing alone; and (H) species that exhibit primary sequences 
different from those predicted by pooled sequencing. The additional data 
provided by further exploration of fractionated extracts from several of the B15 
molecules in this manner expands the molecules with the potential for 
presenting B*1501-overlapping ligands to include not only B*1503 and B*1512 
but B*1508 and B*1510 as well. Such diversity among fractions indicated that 
characterization of individual ligands would provide information not available in 
motifs. 

Individual peptides from B*1501, B*1503, B*1508, 6*1510, and B*1512 
were comparatively examined to investigate whether the added flexibility 
observed through Edman degradation of RP-HPLC fractions would allow for 
natural ligand overlaps to occur across their respective polymorphisms. 

More than 400 individual ligands extracted from the five distinct HLA-B15 
allotypes were characterized according to the methodology of the present 
invention. The ligands characterized here were from ion map masses found in 
multiple B15 allotypes as demonstrated in FIG. 13A. Selected ions were then 
dissociated by NanoES-MS/MS, and the resulting fragment information was 
compared and interpreted, as described hereinabove, to determine if the ions 
represented sequence-identical or merely mass-identical ligand matches. 

Individual peptide ligands characterized from the five B15 allotypes are 
listed in Tables A-E. The number of ligands for which either complete or partial 
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sequences were obtained here was as follows: B*1501 = 126, B*1503 = 74, 
B*1508 = 96, B*1510 = 123, and B*1512 = 30. While the pooled motifs of 
peptides extracted respectively from the five molecules described nonamers 
with various P2 and P9 dominant anchors and P3 auxiliary anchor preferences 
(FIGS. 15 and 16), the single peptide sequences ranged from 7 to 12 amino 
acids in length and demonstrated (i) greater heterogeneity at their N-terminal/ 
proximal regions than their C termini, and (ii) varying degrees of observed 
ligand overlap, both of which will be examined in the subsequent sections of 
this chapter. 

In terms of length heterogeneity, the endogenous peptides eluted from 
B*1501, B*1503, B*1508, B*1510, and B*1512 varied in length from 7 to 12 
amino acids as shown (FIG. 20). An overall length breakdown of the peptides 
listed in Tables A-E demonstrates that approximately 6% are heptamers, 21% 
are octamers, 50% are nonamers, 19% are decamers, 3% are undecamers, 
and 1% are dodecamers. Further emerging from the length characterization 
of individual ligands is the observation that peptides bound by each of the B15 
molecules spanned ranges of 5 to 6 amino acids in length. For example, 
peptides eluted from B*1501, B*1510, and B*1512 were 7-11 amino acids in 
length, while those from B*1503 and B*1508 were 7-12 amino acids in length. 

Coupling this length variability with the likewise varying degrees of 
regional sequence heterogeneity (which will be discussed) leaves only 23% of 
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the endogenously loaded peptides characterized in Tables A-E as "ideal 
nonamers" with both P2 and P9 anchors in concordance with the dominant or 
strong preferences of the pooled Edman motifs from their respective source 
molecules. This finding is of principal significance in that a majority (77%) of 
potential ligands for any of these HLA-B15 molecules would therefore be 
overlooked if the length and sequence constraints of their pooled motifs were 
utilized as the primary criteria in searching for potential epitopes specific to 
them. 

Examples of ligands from this study with homology to stretches of known 
proteins are shown in Table 4. The peptides yielding 100% identical BLAST 
database hits were grouped into seven categories, which were defined here 
according to the common natures of their potential source proteins: HLA 
ligands, replication/transcription/translation ligands, biosynthetic/degradative 
modification ligands, signalling/modulatory ligands, transporter/chaperone 
ligands, structural/cytokinesis ligands, and unknown function ligands. Aside 
from the HLA heavy chain-derived ligands, most appear to be derived from 
cytoplasmic or nuclear proteins, which illustrates that the typical endogenous 
pathway is involved in generating the majority of the class I-loaded peptides 
characterized (York and Rock 1996). 

Of the 44 peptide sequences listed in Table 4, it is noteworthy that 
overlaps across other HLA-B15 molecules are evident within our data collection. 
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The B*1510 tapasin 354 . 362 ligand HHSDGSVSL, as well as both THTQPGVQL from 
septin 2 homolog 70 _ 78 and S HANS AWL from p-adaptin 249 _ 257 , have also been 
sequenced from B*1509 extracts (Barber et al. 1997), and the 
B*1501/B*1508/B*1512 ubiquitin-protein ligase 83 _ 91 -derived ligand ILGPPGSVY 
was characterized from endogenously bound B*1502 peptides (Barber et al. 
1997). The eIF3-p66 61 _ 69 nonamer SQFGGGSQY (Falk et al. 1995; Barber et al. 
1996) was found here within B*1501, B*1503, B*1508, and B*1512 extracts. 
The decamer YMIDPSGVSY, which is homologous to proteasome subunit C8 150 . 
159 , was also previously described as a ligand for B*1502 (Barber et al. 1997), 
B*1508 (Barber et al. 1997), and B*4601 (Barber et al. 1996); It was found 
here presented by B*1501, B*1508, and B*1512. Some of the specific 
overlapping ligands identified in this study therefore overlap in antigen 
presentation with the HLA-B15 allotypes characterized by others. 

Given the length heterogeneity observed among the ligands collectively 
characterized from B*1501, B*1503, B*1508, B*1510, and B*1512, analysis 
of peptide ligand primary structures proceeded through two separate 
alignments (N- and C-terminal) for each HLA-B15 allotype. The frequencies 
with which specific side chains occurred at (i) the N terminus and first three 
residues internal from it, and (ii) the C terminus and the first three residues 
internal to it were tabulated. As graphically summarized throughout parts A 
and B, respectively, of FIGS. 21-25, the salient features of these ligand regions 



-73- 



WO 02/062846 PCTAJS01/49744 

and how they correlated with the structures of the molecules from which they 
were extracted can be examined. 

The N-terminal/proximal regions for ligands from each of B*1501 (FIG. 
21A), B*1503 {FIG. 22A), B*1508 (FIG. 23A), B*1510 (FIG. 24A), and B*1512 
(FIG. 25A) clearly demonstrated acceptance of a variety of amino acid side 
chains, particularly at P3 and P4, by the portions of the binding groove assumed 
to interact with ligands at the designated positions. With the exception of 
B*1512 ligands, which were obtained from both a smaller and more biased 
collection of ions (see FIG. 25 legend), higher points in each of the graphs occur 
for certain side chains indicated along the PI and, to a greater extent, P2 data 
lines, which represent the first and second positions, respectively, of the 
characterized ligands. 

The results for PI obtained from the HLA-B15 ligands characterized are 
of interest since the analysis of Edman sequencing data depends upon 
comparing relative increases between cycles and is therefore unreliable for 
making side chain determinations at this first position, especially when complex 
mixtures of peptides are examined (Stevanovic and Jung 1993). All five B15 
allotypes demonstrated a number of side chains at PI, with preferences for 
residues including Ala, Leu/Ile, Gly, Ser, Thr, and Tyr observed in varying 
degrees among them (FIG.s 21-25, A). Overall, PI appeared to be occupied in 
a majority of ligands by aliphatic amino acids. 
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Though the pooled motifs of B*1501 and B*1512 (FIGS. 15 and 16) as 
well as the spectral ion maps obtained from their RP-HPLC fractions (data not 
shown) were virtually identical, the substitution difference between the two 
molecules at A-pocket residue 167 suggested that ligands bound by these 
molecules might differ at PI. Performing NanoES-MS/MS upon a handful of 
ions, which appeared to be exclusive to B*1512, confirmed the presence of 
several ligands presented by B*1512 but not B*1501 (Table 5). Of the 16 
sequenced, seven indicated His, two indicated Arg, and one indicated Lys at PI. 
In comparison, only a single B*1501 peptide each presented with His, Arg, or 
Lys at PI out of 126 ligands characterized (Table A). An explanation for the 
existence of this subset of B*1512-restricted ligands with positively-charged N- 
termini could lie in the W-G substitution observed between B*1501 and 
B*1512 at a 2 position 167, which might sterically enhance the influence by the 
adjacent acidic residue at position 166 (Glu in B*1501 and Asp in B*1512) of 
B*1512 upon PI in the A-pocket. Comparing individual ligands between 
B*1512 and B*1501 supports the notion that the polymorphism segregating 
them will confer distinct yet subtle effects upon peptide binding by other 
allotypes differing in this manner. 

P2, which has been considered to act as a primary anchor for peptide 
ligands among most class I molecules described to date as based upon pooled 
Edman motifs, is classically accepted to associate with the B-pocket of the 
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peptide binding groove. In terms of the motifs derived here by pooled 
sequencing (FIGS, 15 and 16) and the motifs previously established for other 
B15 family allotypes (Faiketai. 1995; Barber etal. 1996; Barber etal. 1997), 
a Gin at 92 is common to B*1501, B*1502, B*1503, B*1512, and B*1513 
(Table 6). Three alleles, 6*1502, B*1513, and B*1508, have a Pro at P2, while 
the lack of a strong Pro at P2 in both B*1501 and B*1503 corresponds to 
polymorphism at heavy chain positions 63 and 67. For example, B*1501 
appears to lose the propensity for Pro at P2 due to polymorphism at position 
63, while B*1508 appears to lose a Gin at P2 resulting from polymorphism at 
67. Thus, comparisons within the B15 family highlight how substitutions at 
positions 63 and 67 of the class I heavy chain cn 1 helix appear to confer 
differential interaction with P2 of the peptide ligand. 

While residue 63 modulates the size/conformation of P2, it can be seen 
that residues 24 and 45 influence the P2 charge nature propensities. A 
comparison of B*1501 and B*1503 illustrates how polymorphisms at positions 
24 and 45 of the B-pocket influence P2 preferences in this manner; B*1503 is 
one of four B15 alleles with known motifs bearing a positively charged P2. 
Allotypes B*1509, B*1510, and B*1518 recognize a positively charged His at 
P2 and have the same residues at 24 and 45 as B*1503, but the differences at 
positions 63 and 67, which separate B*1503 from the other three molecules, 
again modulate the contour of P2 such that different positively charged P2 
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residues fit respectively into the B*1503 and B*1509/B*1510/B*1518 B-pocket 
categories. 

It has previously been proposed that polymorphisms in the cx x helix 
prompt major changes in the repertoires of peptides bound by allotypes 
differing in this region (Barber et al. 1997). That any region, helical or sheet, 
of a x would influence peptide P2 preferences more than a 2 is of little surprise 
though since 14 of the 18 residues forming the B-pocket belong to the cn 1 
domain (Table 6). A comparison of 12 known B15 motifs in the B-pocket 
suggests more refined rules for a 2 in general, whereby polymorphisms in the 
helices sculpt the conformation and size of the amino acids that can fit into the 
peptide binding groove's B-pocket. Further analysis of the B15 motifs at P2 
suggests that polymorphisms lining the floor of the groove tend to regulate the 
hydrophobic and/or charged nature of the residues at P2 of bound ligands. 
Perhaps in this way the walls and floor of the binding groove work in concert: 
the a-helical residues sterically control which amino acids can fit, while the 13- 
sheet residues act to attract or repel particular side chains based on chemical 
compatibility within the ligand binding groove. 

The interactions described for P2 here are, however, more relaxed than 
previously thought. For a majority of the allotypes shown in Table 6, three or 
more different side chains are observed by pooled Edman sequencing as 
dominant or strong B-pocket residents and/or the B-pockets of the molecules 
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demonstrate abilities to naturally accommodate alternative P2 residues (Table 
3 and FIGS. 21-25, A). Of specific interest, although Met appears in the 
B*1503 pooled motif as a strong P2 anchor residue (FIG. 16), only two of the 
peptides characterized for this allotype bear Met at P2 (FIG. 22A), while other 
residues including Gly, Pro, Ala, and Asn occur more often than Met at P2 
among B*1503 ligands. The fact that Met appears in the pooled motif but fails 
to demonstrate a strong presence among the individual peptides indicates that 
disparate concentrations of ligands within extracts may skew pooled Edman 
sequencing results so as to be misleading. 

Pro and Ala likewise appear with frequencies comparable to or exceeding 
those of the W6/32-purified pooled motif residues for B*1501 (FIG. 21A), and 
B*1508 ligands illustrate P2 inclinations for a rich array of side chains in 
addition to the motif-prescribed residues Pro and Ala which include Gly, Val, 
Met, Leu/Ile, Ser, Thr, and Gln/Lys. Similar variety is observed within the 
limited B*1512 ligand data set (FIG. 25A). In contrast, the B-pocket 
composition for B*1510 indicates His as the sole dominant/strong P2 occupant 
(Table 6), and among individual ligands characterized from B*1510 His Is noted 
at a markedly higher degree than are alternative residues (FIG. 24A). 
However, amino acids including not only the positively charged Arg but to a 
greater extent Gly, Ala, Val, Leu/Ile, and Gln/Lys occur at P2 of some peptides 
are also characterized at P2 from this allotype. Thus the majority of HLA-B15 
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molecules demonstrate elastic N-proximal occupancies. 

In comparison with the findings at the N-terminal/proximal regions, the 
C termini of ligands from each of B*1501 (FIG. 21B), B*1503 (FIG. 22B) f 
B*1508 (FIG. 23B), B*1510 (FIG. 24B), and B*1512 (FIG. 25B) demonstrated 
a stricter acceptance of amino acid side chains. C-proximal ligand residues also 
revealed the existence of more distinct side chain tendencies. 

For allotypes B*1501, 6*1503, B*1508, and B*1512 a dominant C 
terminus was especially prominent among the ligands characterized from them, 
while B*1510 exhibited a P2 anchor nearly as strong as its primary Oterrninal 
residue preference (FIG. 24, A and B). The aromatic residues Phe and, even 
more prominently, Tyr occupied the C-terminal positions of most peptides 
bound by the first four B15 molecules, which appeared to agree with P9 of their 
respective motifs (FIGS. 15 and 16). The bulk of B*1510 ligands demonstrated 
Leu/Heat their C termini; other occupants at this position included Phe and Val, 
an interesting observation in that more B*1510 peptides presented with Val, 
which is not included in either the pooled (FIG. 16) or fractional (Table 3) 
Edman motifs examined from this allotype, than Phe, which is identified as a 
strong P9 occupant by pooled sequencing. Such is the likely result of disparate 
peptide concentrations affecting the pooled Edman sequencing results as 
mentioned previously. Another factor includes the diminishing picomole yields 
per successive cycle of Edman degradation; this leads to progressively higher 
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background signals and thus negatively affects sensitivity in examining the C- 
terminal/proximal regions of peptides (Stevanovic and Jung 1993). 

The overwhelming conservation at the C termini of individual ligands 
(FIGS. 21-25, B) indicates that the C terminus acts as a dominant anchor for 
peptide ligands. P9 of pooled Edman motifs has classically been accepted to 
associate with the F-pocket of the peptide binding groove. C-terminal 
anchoring is observed here regardless of length heterogeneity (FIG. 20 and 
Tables A-E). For the majority (91%) of peptides greater than 9 residues long, 
this observation agrees with evidence that longer peptide ligands bulge 
centrally outward from the peptide binding groove. Sequencing individual 
ligands supports a concept that the C terminus of a ligand plays a dominant role 
as an anchor within the class I binding groove for the HLA-B15 allotypes 
examined. 

With regard to pooled sequence motifs for HLA-B15 allotypes, all 12 
molecules demonstrate chemical homogeneity at P9, with dominant/strong 
occupancies by hydrophobic residues (Table 7). The eight different F-pockets 
structurally represented among these allotypes show preferences forTyr, Phe, 
Met, Leu, and Trp according to three functional group categories. This is in 
marked contrast with the B-pockets, for which eight different B-pockets among 
the same allotypes comprised seven distinct functional groups encompassing 
a mixture of both hydrophobic and hydrophilic side chains (Table 6). 
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It is interesting that, of these HLA-B15 molecules, nine have F-pocket 
functionality in the same category (6*1501, B*1502, B*1503, B*1508, 
B*1512, 6*1516, B*1517, B*1518, and B*4601), with preferences for Tyr, 
Phe, and/or Met, despite the fact that they exhibit amino acid substitutions at 
nine different positions throughout the cc x helix and P 2 sheets. This redundancy 
demonstrates that, contrary to what was seen among structural residues 
affecting the B-pocket, the a x helical poiymorphism(s) shown for allotypes in 
the first category of Table 7 do not necessarily play a defined role in sculpting 
either the conformation or size preferences of ligands in this region of the 
peptide binding groove. 

While the Edman-derived motifs for the B15 allotypes shown in FIGS. 15 
and 16 clearly indicated P2 and P9 primary anchors and suggest an assortment 
of preferences at both P3 and P4, they fail to sufficiently capture the trends for 
auxiliary anchoring at the C-proximal regions of endogenously bound ligands 
which were perceptible throughout the individual peptide sequences. Additional 
preferences that likely serve as auxiliary anchors were evident at the C- 
proximal positions C 1 , C 2 , and C 3 in the cases of nonamers, octamers, and 
heptamers, as well as in ligands longer than nonamers. A review of the 
positional frequencies observed among the HLA-B15 peptides shows that amino 
acids such as Val, Leu/Ile, Ser, Thr, and Gln/Lys tended to predominate at 
nearly all three of the C-proximal positions of ligands presented by B*1501, 
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B*1503, B*1508, B*1510, and B*1512 (FIGS. 21-25, B). Of these residues, 
the hydrophilic, hydroxyl-containing Ser and Thr were especially frequent 
among these positions; nearly half (40%) of the ligands listed in Tables A-E, 
bear Ser and/or Thr at the designated C-proximal positions. In general, Glu 
occupied C 1 and Gly occupied C 3 to some extent among all five allotypes. 

Val (C 1 ) and Pro (C 2 ) were especially prominent C-proximal residues 
observed among the B*1510 ligands; the overriding presence of Pro, which 
distinguished this region of B*1510-derived peptides from those of the other 
allotypes, can likely be attributed to steric influences imposed by the Tyr at or 2 
position 116 in B*1510, which additionally interacts with the C- and E-pockets 
of the peptide binding groove (FIG- 3). Further distinguishing several B*1510 
ligands from but rare occurrences among B*1501, B*1503, B*1508, and 
B*1512, Pro frequently appeared as well in various C-proximal sequence 
combinations with Aia or Val (Table 8). 

The amino acid residues characterized from each of the five HLA-B15 
allotypes with occupancy rates of at least 10% for the first four (N- 
terminal/proximal) and last four (C-terminal/proximal) positions among ligands, 
respectively, are condensed in Tables 9-13. Presenting the data already 
discussed in this manner effectively emphasizes C-terminal dominance and N- 
proximal flexibility. By comparison, the data illustrates the limitations of pooled 
Edman motifs in being able to adequately reflect a consensus of the individual 
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peptides contained within a given ligand extract- The N sum /C sum quotients 
obtained as described for Tables 9-13 were less than 1-00 in the cases of all 
allotypes, thus providing a more fixed description to the C-terminal/proximal 
region (gray) as a whole with respect to the N-terminal/proximal region (black). 

Among the N-regional position ligand residues occurring at >10%, 
nothing appears to prominently stand out at P3 and P4 although assignments 
were made to these positions via Edman sequencing. The occupancies that 
were observed were not necessarily captured by the motifs; a specific 
illustration of this is Ala (19.51%) at P3 of B*1510 (Table 12 and FIG. 16). This 
trend appeared likewise applicable at the P2 anchor, where with the exceptions 
of B*1508 and B*1510, occupants at this position among > 10% of ligands from 
each of the remaining allotypes included additional side chains (for example, Ala 
at P2 in both B*1501 and B*1512) not accounted for by pooled sequencing. 
The C termini of each allotype are comprised of two amino acid specificities as 
shown by more than 80% of characterized peptides in all cases (Tables 9-13). 
In summary, comparing observed IM- and C-regional occupancies among the 
characterized ligands underscores the flexibility of N-proximal versus the 
dominance of Oterminal preferences among the B*1501, B*1503, 6*1508, 
B*1510, and B*1512 binding grooves. 

A total of 40 specific ligands among the 449 characterized here (Tables 
A-E) overlapped across the peptide binding grooves of B*1501, B*1503, 
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6*1508, and/or B*1512 (Table 14); as previously discussed from the 
information in Table 4, some of the overlapping ligands likewise coincided with 
ligands characterized by others from additional HLA-B15 allotypes including 
B*1502 and B*4601. Length variations among the overlapping ligands 
identified tended to mimic those observed among the entire set of ligands 
characterized, formerly discussed and illustrated in FIG. 18. Only seven 
overlapping ligands were longer than 9 amino acids in length (4 decamers and 
3 undecamers), while 16 fell short of 9 residues long (6 heptamers and 10 
octamers); less than 50% of the successful overlaps were therefore nonameric. 

Throughout the mapping and sequencing approach that was developed 
and executed as outlined earlier (FIG. 6), an extensive comparison was first 
conducted upon ions occupying RP-HPLC fractions 6-19 from separations as 
shown in FIG. 11 of ~400 pg of peptides from each of B*1501, B*1503, 
B*1508, and B*1510, the first four B15 molecules prepared in the course of 
this study. From this, 21 peptide overlaps across B*1508 and B*1501 were 
defined. Similarly, eight ligands overlapping B*1501 and B*1503 were 
identified, and four ligands were found to overlap across B*1508, B*1503, and 
B*1501. A conservative estimate, based upon past examination of B*1501 
ligands, is that the ion maps for each of the B15 allotypes represented at least 
2,000 individual peptides per molecule (Prilliman et al. 1997), yet B*1510 was 
not observed to share ligand overlaps with B*1508, B*i5011, or B*1503 (Table 
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14). 

The sequence data indicates that overlapping ligands bind across 
divergent B*1508, B*1501, and B*1503 binding grooves but not B*1510. This 
pattern likewise accentuates an apparently dominant role for C-terminal 
anchors in natural peptide binding as discussed previously. FIG. 26 depicts, in 
the context of the class I peptide binding cleft, the locations of polymorphisms 
that individuate B*1508, B*1501, 6*1503, and B*1510 and highlights the 
anchoring residues for the peptide overlaps according to the N-proximal and C- 
terminal specificities of their respective presenting molecule's motif (FIGS. 15 
and 16). Bolding the amino acids of these overlapping ligands, which are in 
agreement with the traditional pooled motifs, underscores the trend whereby 
a C-terminal anchor sequence is conserved within overlaps while the N-proximal 
anchor is considerably more flexible in its location and/or sequence. A lack of 
overlaps with B*1510 could potentially be explained by the S-Y substitution 
observed between this allotype and the other three at cc 2 position 116. Thus, 
the conserved C-terminal anchors that facilitate the occurrence of B*1508, 
B*1501, and B*1503 overlaps fail to preferentially interact with the B*1510 C- 
terminal specificity pockets. 

A further example provided here of how C-proximal auxiliary anchors 
might positively impact endogenous ligand binding is that eight of the peptides 
overlapping both the B*1508 and B*1501 antigen binding grooves bear Thr at 
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C 1 , C 2 , or C 3 , and in four cases the peptides that bind B*1508/B*1501 or 
B*1508/B*1501/B*1503are heptamers with Thr occupying P7, their Oterminai 
positions (FIG. 26). The prominent role of Thr as a C-terminal/proximal 
auxiliary anchor Is dramatically illustrated by the B*1508/B*1501/B*15O3 
overlapping heptamer CPLSCFT, where Thr provides a C-terminal anchor for 
this ligand not evident in the pooled motifs of the three allotypes. 

Distilling the data from the overlapping ligands among 6*1501, 6*1503, 
and B*1508 suggests a model for endogenous ligand binding whereby peptides 
are first anchored or held in the class I binding groove by their C termini. In 
order for a given peptide to remain stably fastened in the groove for successful 
trimer assembly and subsequent export from the cell, it is observed that 
following rigid anchoring at the C terminus as described, a ligand must be 
subsequently tethered into the class I antigen binding cleft at a more variably 
defined N-proximal position. Such is the case for peptide ligand NQZHGSAEY, 
a nonamer that overlaps across B*1508, B*1501, and B*1503 (FIG. 24). 
According to this model (FIG. 27), a C-terminal Tyr securely anchors 
NQZHGSAEY into all three B15 allotypes, while a Gin at P2 anchors the peptide 
into B*1501 and B*1503 and a Gln/Lys (most likely a Lys based upon both 
motif assignments and fractional Edman sequencing data) at P3 provides 
additional anchoring for B*1501 and serves as the sole N-proximal anchor for 
B*1508. This model appears clearly applicable to at least 75% of the ligands 
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presented in FIG. 26; for those peptides to which it does not evidently apply, 
the possible anchoring modes remain open to further speculation at the level 
of individual ligands. 

For example, the B*1501/B*1503 overlap AQFASGAGZ (FIG. 26) may 
instead be additively stabilized through the N-proximal anchors indicated at P2 
and P3 as well as at the N-terminal position, since Ala demonstrated significant 
PI occupancy among both B*1501 and B*1503 ligands, as previously shown 
(FIGS. 21 and 22, A). Additionally, the four heptameric overlaps that were 
observed across B*1508/B*1501/B*1503, which terminate in Thr, could lie 
within the peptide binding groove such that they are anchored N- 
terminally/proximally and their C termini interact with the C-proximal regions 
of the groove, which have demonstrated preferences for Thr; these ligands 
might therefore fail to extend into the F-pocket. As compared with C-terminal 
sequences, both length and N-proximal specificity characteristics of ligands 
generally play secondary roles in the natural binding of B15 peptide epitopes. 

Further stemming from the data obtained by comparatively examining 
B*1501, B*1503, 6*1508, and B*1510 ligands, differential interactions with the 
chaperone tapasin specifically influence the loading of peptides into HLA-B15 
molecules. Tapasin, an MHC-encoded chaperone discussed hereinabove 
(Herberg et al. 1998), is a recently-discovered 48 kDa transmembrane 
glycoprotein resident to the ER that directly interacts with both calreticulin and 
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empty a-chain/p 2 m dimers to form a "loading complex" linked to TAP1/TAP2 
(Sadasivan et ai. 1996; Grandea III et al. 1997; Li et al. 1999). Tapasin is not 
a requirement for ligand loading via the typical endogenous processing pathway 
(Lewis et al. 1998; Peh et al. 1998), and aside from its proposed role in serving 
as a bridge between a class I dimer and the peptide transporter until release 
of mature trimers upon peptide binding, the exact role of tapasin during class 
I assembly is unknown (Pamer and Cresswell 1998). Interactions between 
nascent class I molecules and TAP1/TAP2 have, however, been shown to be 
influenced either directly or indirectly by a 3 and positions 116 and 156 of a 2 
(Suh et al. 1999; Kuiig et al. 1998; Neisig et al. 1996). 

Of specific interest, past analysis of divergent HLA-B35 molecules has 
indicated that allotypes bearing an aromatic amino acid (Phe orTyr) at position 
116 interacted with TAP1/TAP2; allotypes bearing a Ser substitution at this site 
failed to demonstrate the interaction (Neisig et al. 1996). Likewise, data 
subsequently obtained for the sHLA transfectants utilized here according to 
established immunoprecipitation protocols (for example, Harris et al. 1998) 
indicates that although all four allotypes associate with calreticulin, B*1501, 
B*1503, and B*1508 do not associate with tapasin (and therefore not with 
TAP1/TAP2) whereas B*1510 does. Membrane-bound forms of B*1501 and 
B*1516 have previously been shown by others to not associate with TAP1/TAP2 
(Neisig et al. 1996; de la Salle et al. 1997), demonstrating that results obtained 
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from the sHLA transfects are in concordance with those of native molecules. 

Though functionally divergent according to its pooled motif (FIG. 16) and 
the majority of peptides that it binds, B*1510 is capable of accommodating 
ligands with the properties favored by the B*1501, 6*1503, and B*1508 
binding grooves. Data both from individual ligands (Table D) and fractional 
Edman sequencing (Table 3) indicate that Tyr can occupy the C-terminal 
position, and specific examples in Table 4, including the spleen mitotic 
checkpoint BUB3 53 _ 60 octamer YQHTGAVL and the splicing factor U2AF large 
chain 179 . 187 nonamer TQAPGNPVL, attest to B-pocket flexibility. It is intriguing 
that among the peptides bound by B*1510 is the tapasin 354.352 nonamer 
HHSDGSVSL; the peptide appears to occupy ligand extracts in a high copy 
number, as qualitatively based upon relative mass spectrometricion intensities. 
Given this, as well as considering potential models of loading complex 
interactions suggested by others (IMeisig et al. 1996; Elliott 1997), it can be 
extrapolated that a portion of tapasin, analogous to class II-associated invariant 
chain-derived peptides (Riberdy et al. 1992; Sette et al. 1992), extends into 
and blocks a region of the empty class I binding groove until it is displaced by 
an optimally-fitting ligand and/or secondary chaperone; this could also account 
for the differences in overall P2 flexibility observed between B*1510 peptides 
and those of the other three allotypes. Participating in ligand selection by this 
mechanism would describe a distinct peptide editing role for tapasin and could 
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clarify the inability to detect overlaps between B*1510 and either B*1501, 
B*1503, or B*1508. 

In addition to the initial search for overlaps across B*1501, B*1503, 
B*1508, and B*1510, a comparative analysis was performed between the ion 
maps of B*1501 and B*1512. As discussed previously hereinabove, such an 
examination is primarily important in revealing the presence of ligands bound 
by B*1512 but not B*1501 (Table 5). A number of overlapping ligands from 
6*1512, however, were additionally identified (Table 14). Conservative 
percentages of overlap subsequently observed among each of the four 
molecules from which ligands were characterized and the ancestral HLA-B15 
allotype, B*1501, were determined as shown and described in Table 15. 

As expected from an overview of their nearly identical ion maps, B*1501 
and B*1512 demonstrated the highest overlap frequency between the allotypes 
at 70% among ions subjected to NanoES-MS/MS. After this, B*1503 and 
B*1508 respectively exhibited 14% and 9% overlap frequencies, while as 
shown earlier B*1510 completely failed to reveal overlaps with B*1501. The 
trend distinctly illustrates that the polymorphisms which distinguish the 
B*1503, B*1508, B*1510, and B*1512 peptide binding grooves from B*1501 
are not functionally equivalent in terms of their impacts upon class I ligand 
association. However, it is also evident that they do not create concrete 
barriers to ligand binding. Because the repertoires of peptides bound by the 
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various molecules examined may differ from B*1501 at frequencies greater 
than 80% (B*1508 and B*1503, Table 15) does not mean that they are unable 
to bind similar or completely identical peptides, a concept which has been 
incompletely addressed and occassionally negated by other studies grounded 
more upon pooled Edman sequencing, analysis of prominent extract 
constituents, or binding/reconstitution assays. 

In particular and based upon previous research, the overlaps between 
B*1508 and B*1501 defined here would specifically not have been predicted 
(Barber etal. 1997); it may be anticipated by extension that other molecules 
differing solely by the polymorphism separating B*1508 and B*1501, for 
example B*1503 and B*1529 (Appendix A), yield similar overlap frequencies. 
Likewise, based upon the relatively high overlap frequency observed between 
B*1512 and B*1501, the substitutions at positions 166 and 167 distinguishing 
them exhibit a similarly subtle effect between A*2902 and A*2903 (which only 
differ from one another by the identical substitutions studied here; Prokupek et 
al. 1997) via mapping and sequencing of individual ligands. Systematically 
attempting to define the limits of overlap existence as conducted, therefore, 
demonstrates a critical departure from standard approaches which enhance 
predicting the abilities of different class I molecules to present overlapping 
ligands. 

Comparative analyses of closely related soluble MHC class I molecules 
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produced by the recombinant methods described herein, provide a means for 
assessing the functional impact of individual cc-chain polymorphisms. The 
primary impetus for characterizing peptides extracted from class I molecules 
is to more precisely understand the influence of structural polymorphism upon 
the presentation of endogenous ligands. This is important since a fundamental 
realization of how naturally processed peptides bind to both individual and 
multiple class I allotypes can then be translated into protein and/or peptide- 
based therapies intended to elicit protective CTL responses. Therefore, an 
accurate interpretation of sequence data from such class I-bound peptides, 
either individual or pooled, should in turn further the selection of optimal viral 
and tumor-associated ligands to expedite the development of successful 
therapeutic applications. 

The extensive examination of HLA-B15 ligands, as described herein, 
enhances understanding the rules that govern natural class I peptide 
presentation and is secondary evidence of the success and usefulness of the 
methodology for producing soluble MHC class I and II molecules described and 
claimed herein. By first building upon the traditional foundations provided by 
pooled Edman motifs (FIGS. 15 and 16), the data from over 400 individual 
peptides characterized from B*1501, B*1503, B*1508, B*1510, and B*1512 
subsequently indicated that queries for potential epitopes specific to these 
allotypes would benefit from being optimized in three ways. First, although 
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nonamers represent half of the ligand population, the other 50% of peptide 
epitopes range down to 7 and up to 12 amino acids in length. Second, effective 
N-proximal anchor requirements need not be strictly imposed at P2. Third, 
searches for ligands should weigh C-terminal/proximal sequence matches even 
more heavily than those of the N-region. The third trend revealed represent, 
the most substantial of the revised search criteria, since both length variations 
and lower sensitivity due to the diminishing returns and increasing backgrounds 
inherent to successive Edman sequencing cycles can C-regional motif trends 
(Stevanovic and Jung 1993). As stressed earlier, examples of this bias were 
evident here in both: (i) the stronger preference by B*1510 for peptides 
terminating in Val (absent from the pooled motif) rather than Phe (present in 
the pooled motif) (FIGS. 16 and 24B); and (ii) the inability of motifs to 
effectively reflect C-proximal auxiliary anchors (FIGS. 15; 16; and 21-25, B). 

To illustrate the potential consequences of applying the modified search 
parameters described, the EBV structural antigen gp85, which has recently 
been implicated using a murine model as a favorable target against which 
protective CTLs might be generated (Khanna et al. 1999), was examined in the 
context of B*1501 to identify: (i) nonameric epitopes with motif-prescribed P2 
and P9 occupancies; (ii) length variant epitopes with motif-prescribed P2 and 
P9 occupancies; and (iii) nonameric epitopes with flexible P2 occupancy (Table 
16). Since only these three categories of ligands were designated, the inquiry 
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was not exhaustive. However, the information extracted showed that, of the 
98 possible epitopes identified, only the 22% within the first column would be 
placed under further experimental consideration if pooled motifs alone were 
applied in the search. This is not to imply that the data in this category is 
invalid but that it might be considerably incomplete for later applications. For 
example, if either of the AMTS KFLM GTY 172 . 182 (varying by length) or the 
SAPLEKQLF 123 . 131 (varying by P2 occupancy) peptides was demonstrated to elicit 
a more effective antigen-specific CTL response than any of the nonamers 
bearing standard motif P2/P9 assignments, this knowledge is pivotal to 
subsequent vaccine design; even if the two designated peptides evoked 
responses only equivalent to some of the nonamers, their non-motif length 
and/or sequences discrepancies could prove superior in conferring the ability 
to overlap multiple allotypes in addition to B*1501. This is advantageous since 
a vaccine consisting of a single or limited number of peptide specificities could 
theoretically be effective for protecting populations differing in HLA type (Loftus 
et al. 1995; Gulukqta et al. 1996; Sidney et al. 1996c). 

The majority of information collected from examining divergent HLA-B15 
allotypes of sHLA molecules recombinantly produced according to the 
methodology of the present invention demonstrates that similar and 
occassionally identical peptide ligands are presented by the different B15 
molecules so long as polymorphisms do not alter C-terminal anchoring pockets 
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and while an N-proximal Hgand residue can be subsequently anchored within 
the binding groove. Supporting data furthermore indicates that these principles 
additionally extend beyond the HLA-B15 allotypes described in specificity 
herein. Specifically, unpublished results by Ghosh and Wiley (noted in Bouvier 
and Wiley 1994) indicate that an octamer has been observed to successfully 
bind a class I molecule by its C terminus despite being shown through x-ray 
crystallography to not even reach the N-terminal pocket of the binding groove. 
In addition, a recently-described HIV-gag 19(7 . 205 crL epitope presented by murine 
class I K d fails to show a motif- prescribed Tyr at P2 and instead associates 
stably through its conserved C terminus and an N-proximal preference for Gin 
at P3 (Mata et al. 1998). The rules established here through examination of 
hundreds of natural ligands from B*1501, B*1503, B*1508, B*1510, and 
B*1512 indicate that such occurrences may be more commonplace than 
exception, as both of these examples appear in agreement with the model in 
FIG. 27. 

A step in developing therapies intended to elicit protective CTLs requires 
the selection of pathogen- and tumor-specific peptide ligands for presentation 
by MHC class I and class II molecules. Binding/reconstitution assays provide 
information that is biased due to their technical inconsistency and/or in vitro 
nature, while Edman sequencing of extracted class I peptide pools generates 
"motifs" that indicate that the optimal peptides are nonameric ligands bearing 
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conserved P2 and P9 anchors; motifs have frequently been used to provide the 
search parameters for selecting potentially immunogenic epitopes that might 
be successfully presented by particular allotypes (Pamer et al. 1991; DiBrino 
etal. 1994; Kast et al. 1994; Davenport et al. 1995; Walden 1996; Holland et 
al. 1997; Zhang et al. 1997; Yoon et al. 1998; Chang et al. 1999). Therefore, 
to test the hypothesis that natural presentation overlaps exist despite the 
presence of various polymorphisms within the class I binding groove and thus 
determine how well pooled motifs actually represent their endogenously-derived 
constituents, ligands were purified from different sHLA molecules produced in 
hollow-fiber bioreactors, mapped by RP-HPLC and NanoES-MS, and sequenced 
by NanoES-MS/MS, all according to the methodology of the present invention. 

Production of sHLA* provides an efficient means of extracting large 
quantities of endogenous peptide ligands for the subsequent analyses, and 
comparative ion mapping of peptides extracted from distinct class I allotypes 
is a reliable method for detecting potential ligand overlaps. NanoES-MS/MS 
analysis then allows for sequence characterization to identify the overlap status 
of individual ion matches. The strategy developed to address overlap 
identification is additionally pertinent beyond the uses described herein. For 
example, similar mapping studies would be performed, with the primary intent 
instead of characterizing differences between maps, such as between 
pathogenically infected versus uninfected cell lines; the data obtained could 
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contribute to identifying optimal vaccine epitopes. This concept is currently in 
the early stages of pursuit by several independent research teams (Veronese 
et al. 1996; van der Heeft et al. 1998). Successfully locating differences in a 
similar manner between B*1501 and B*1512 ion maps, as discussed herein, 
effectively supports this application. 

FIG. 28 summarizes the a-chain substitutions and motif-derived P2 and 
P9 anchors for the HLA-B15 allotypes examined here, as well as additional 
allotypes indicated earlier in FIG. 5 that appear to serve as evolutionary 
intermediates between or extensions from B*1503, B*1510, B*1512, and 
B*1518. B*4601 is included since it differs from B*1501 by a single mutagenic 
event and overlaps with it were among peptides characterized in this study 
(Table 4). In section A, the allotypes B*1501, B*1503, B*1508, B*1512, and 
B*4601 have been shown to bind variously overlapping ligands; based upon the 
structurally-predicted motif anchors of the remaining molecules in this section 
including B*1519, B*1529, B*1539, and B*1547, it is suspected that ion 
mapping and characterization would reveal further overlaps according to the 
model shown in FIG. 27. Likewise, in section C the allotypes B*1509 and 
B*1510 have demonstrated overlapping ligands and will probably share some 
with B*1537. 

Systematically mapping and characterizing 449 ligands from the related 
molecules B*1501, B*1503, B*1508, B*1510, and B*1512 demonstrates 
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overall that the peptides bound by these allotypes: (i) vary in length from 7 to 
12 residues; and (ii) are more conserved at their C termini that at their N- 
proximal positions. Flexibility at P2 in particular appears to arise at least in part 
from the combined effects of distinct steric and charge biases imposed 
respectively by oc-helical and (3-sheet structural residues throughout a x and a 2 
of the various HLA-B15 molecules, while it is postulated that C-terminal 
preferences are influenced by tapasin-moderated loading selection within the 
ER. 

Although not predictable from the pooled Edman motifs, the comparative 
peptide mapping strategy succeeded in identifying endogenously processed 
ligands which bind variously across the allotypes B*1501, 6*1503, B*1508, and 
B*1512, but not B*1510. Overlapping peptide ligands appeared to favorably 
bind the first four B15 molecules since these allotypes share identical C- 
terminal anchoring pockets, whereas B*1510 diverges in this region. 
Endogenous peptide loading into the HLA-B15 allotypes therefore requires that 
a conserved C terminus be firmly anchored in the appropriate specificity pocket 
while N-proximal residues act more flexibly in terms of both location and 
sequence specificity to anchor the ligand into this binding groove region. 
Subsequently, the choice of allele-specific and/or overlapping peptide epitopes 
for CTL recognition may thus be contingent upon performing queries strongly 
based upon conserved C-terminal anchors. 
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Another embodiment of the present invention, as previously described 
herein-above, is the use of genomic DNA (gDNA) as the starting material for 
the production of the sHLA molecules described hereinbefore. 

This alternative method of the present invention begins by obtaining 
genomic DNA which encodes the desired MHC class I or class II molecule. 
Alleles at the locus which encode the desired MHC molecule are PCR amplified 
in a locus specific manner. These locus specific PCR products may include the 
entire coding region of the MHC molecule or a portion thereof. In some cases 
a nested or heml-nested PCR is applied to produce a truncated form of the class 
I or class II gene so that it will be secreted rather than anchored to the cell 
surface. In other cases the PCR will directly truncate the MHC molecule. 

Locus specific PCR products are cloned into a mammalian expression 
vector and screened with a variety of methods to identify a clone encoding the 
desired MHC molecule. The cloned MHC molecules are DNA sequenced to 
insure fidelity of the PCR. Faithful truncated (i.e. sHLA) clones of the desired 
MHC molecule are then transfected into a mammalian cell line. When such cell 
line is transfected with a vector encoding a recombinant class I molecule, such 
cell line may either lack endogenous class I expression or express endogenous 
class I. It is important to note that cells expressing endogenous class I may 
spontaneously release MHC into solution upon natural cell death. In cases 
where this small amount of spontaneously released MHC is a concern, the 
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transfected class I MHC molecule can be "tagged" such that it can be specifically 
purified away from spontaneously released endogenous class I molecules in 
cells that express class I molecules. For example, a DNA fragment encoding a 
His tail which will be attached to the protein may be added by the PCR reaction 
or may be encoded by the vector into which the gDNA fragment is cloned, and 
such His tail will further aid in purification of the class I molecules away from 
endogenous class I molecules. Tags beside a histidine tail have also been 
demonstrated to work and are logical to those skilled in the art of tagging 
proteins for downstream purification. 

Cloned genomic DNA fragments contain both exons and introns as well 
as other non-translated regions at the 5' and 3' termini of the gene. Following 
transfection into a cell line which transcribes the genomic DNA (gDNA) into 
RNA, cloned genomic DNA results in a protein product thereby removing introns 
and splicing the RNA to form messenger RNA (mRNA), which is then translated 
into an MHC protein. Transfection of MHC molecules encoded by gDNA 
therefore facilitates reisolation of the gDNA, mRNA/cDNA, and protein. 

Production of MHC molecules in non-mammalian cell lines such as insect 
and bacterial cells requires cDNA clones, as these lower cell types do not have 
the ability to splice introns out of RNA transcribed from a gDNA clone. In these 
instances the mammalian gDNA transfectants of the present invention provide 
a valuable source of RNA which can be reverse transcribed to form MHC cDNA. 
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The cDNA can then be cloned, transferred into cells, and then. translated into 
protein. In addition to producing secreted MHC, such gDNA transfectants 
therefore provide a ready source of mRNA, and therefore cDNA clones, which 
can then be transfected into non-mammalian cells for production of MHC. Thus, 
the present invention which starts with MHC genomic DNA clones allows for the 
production of MHC in cells from various species. 

A key advantage of starting from gDNA is that viable cells containing the 
MHC molecule of interest are not needed. Since all individuals in the population 
have a different MHC repertoire, one would need to search more than 500,000 
individuals to find someone with the same MHC complement as a desired 
individual - this is observed when trying to find a match for bone marrow 
transplantation. Thus, if it is desired to produce a particular MHC molecule for 
use in an experiment or diagnostic, a person or cell expressing the MHC allele 
of interest would first need to be identified. Alternatively, in the method of the 
present invention, only a saliva sample, a hair root, an old freezer sample, or 
less than a milliliter (0.2 ml) of blood would be required to isolate the gDNA. 
Then, starting from gDNA, the MHC molecule of interest could be obtained via 
a gDNA clone as described herein, and following transfection of such clone into 
mammalian cells, the desired protein could be produced directly or in 
mammalian cells or from cDNA in several species of cells using the methods of 
the present invention described herein. 
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Current experiments to obtain an MHC allele for protein expression 
typically start from mRNA, which requires a fresh sample of mammalian cells 
that express the MHC molecule of interest. Working from gDNA does not 
require gene expression or a fresh biological sample. It is also important to 
note that RNA is inherently unstable and is not easily obtained as is gDNA. 
Therefore, if production of a particular MHC molecule starting from a cDNA 
clone is desired, a person or cell line that is expressing the allele of interest 
must traditionally first be identified in order to obtain RNA. Then a fresh 
sample of blood or cells must be obtained; experiments using the methodology 
of the present invention show that >.5 milliliters of blood that is less than 3 days 
old is required to obtain sufficient RNA for MHC cDNA synthesis. Thus, by 
starting with gDNA, the breath of MHC molecules that can be readily produced 
is expanded. This is a key factor in a system as polymorphic as the MHC 
system; hundreds of MHC molecules exist, and not all MHC molecules are 
readily available from MRNA. This is especially true of MHC molecules unique 
to isolated populations or of MHC molecules unique to ethnic minorities. 
Starting class I or class II protein expression from the point of genomic DNA 
simplifies the isolation of the gene of interest and insures a more equitable 
means of producing MHC molecules for study; otherwise, one would be left to 
determine whose MHC molecules are chosen and not chosen for study, as well 
as to determine which ethnic population from which fresh samples cannot be 
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obtained should not have their MHC molecules included in a diagnostic assay. 

While cDNA may be substituted for genomic DNA as the starting material, 
production of cDNA for each of the desired HLA class I types will require 
hundreds of different, HLA typed, viable cell lines, each expressing a different 
HLA class I type. Alternatively, fresh samples are required from individuals with 
the various desired MHC types. The use of genomic DNA as the starting 
material allows for the production of clones for many HLA molecules from a 
single genomic DNA sequence, as the amplification process can be manipulated 
to mimic recombinatorial and gene conversion events. Several mutagenesis 
strategies exist whereby a given class I gDNA clone could be modified at either 
the level of gDNA or at the cDNA resulting from this gDNA clone. The process 
of the present invention does not require viable cells, and therefore the 
degradation which plagues RNA is not a problem. Thus, from a given gDNA 
clone, any number of gDNA and cDNA MHC molecules can be produced. 

Three useful products can be obtained from the mammalian cell line 
expressing HLA class I molecules from such a genomic DNA construct. The first 
product is the soluble class I MHC protein, which may be purified and utilized 
in various experimental strategies, including but not limited to epitope testing. 
Epitope testing is a method for determining how well discovered or putative 
peptide epitopes bind individual, specific class I or class II MHC proteins. 
Epitope testing with secreted individual MHC molecules has several advantages 
over the prior art, which utilized MHC from cells expressing multiple membrane- 
bound MHCs. While the prior art method could distinguish if a cell or cell lysate 
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would recognize an epitope, such method was unable to directly distinguish in 
which specific MHC molecule the peptide epitope was bound. Lengthy 
purification processes might be used to try and obtain a single MHC molecule, 
but doing so limits the quantity and usefulness of the protein obtained. The 
novelty of the current approach is that individual MHC specificities can be 
utilized in sufficient quantity through the use of recombinant, soluble MHC 
proteins. Because MHC molecules participate in numerous immune responses, 
studies of vaccines, transplantation, immune tolerance, and autoimmunity can 
all benefit from individual MHC molecules provided in sufficient quantity. 

A second important product obtained from mammalian cells secreting 
individual MHC molecules is the peptide cargo carried by MHC molecules. Class 
I and class II MHC molecules are really a trimolecular complex consisting of an 
alpha chain, a beta chain, and the alpha/beta chain's peptide cargo to be 
reviewed by immune effector cells. Since it is the peptide cargo, and not the 
MHC alpha and beta chains, which marks a cell as infected, tumorigenic, or 
diseased, there is a great need to characterize the peptides bound by particular 
MHC molecules. For example, characterization of such peptides will greatly aid 
in determining how the peptides presented by a person with MHC-associated 
diabetes differ from the peptides presented by the MHC molecules associated 
with resistance to diabetes. As stated above, having a sufficient supply of an 
individual MHC molecule, and therefore that MHC molecules bound peptides, 
provides a means for studying such diseases. Because the method of the 
present invention provides quantities of MHC protein previously unobtainable, 
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unparalleled studies of MHC molecules and their important peptide cargo can 
now be facilitated. 

The methodology for producing sHLA from gDNA, while similar to the 
methodology for producing sHLA from cDNA, is different and as such requires 
different and/or unique steps and/or processes for its completion. One 
exemplary detailed production methodology for use gDNA as the starting 
material for the production of MHC class I or II molecules is described herein 
below. 

- Genomic DNA Extraction 

Greater than or equal to 200ul of sample either blood, plasma, serum, 
buffy coat, body fluid or up to 5xl0 6 lymphocytes in 200ul phosphate buffered 
saline was used to extract genomic DNA using the QIAampr DNA Blood Mini Kit 
blood and body fluid spin protocol. Genomic DNA quality and quantity was 
assessed using optical density readings at 260nm and 280nm 

PCR Strategy 

Primers have been designed for HLA-A, -B and -C loci in order to produce 
a truncated amplicon of the human class I MHC using a two-stage PCR strategy. 
The first stage PCR uses a primer set that amplify from the 5' Untranslated 
region to Intron 4. This amplicon is used as a template for the second PCR 
which results in a truncated version of the MHC Class I gene by utilizing a 3' 
primer that sits down in exon 4 at codon 298 (including the leader peptide), the 
5' primer remains the same as the 1 st PCR. An overview of this PCR strategy 
can be seen in FIG. 29. The primers for each locus are listed in Table 17. 
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Different HLA-B locus alleles require primers with different restriction cut sites 
depending on the nucleotide sequence of the allele. Hence there are two 5' and 
two 3' truncating primers for the HLA-B locus. 
1. Primary PCR Materials 

An Eppendorf Gradient Mastercycler is used for all PCR. H 2 0: Dionized 
ultra filtered water (DIUF) Fisher Scientific, W2-4,41. PCR nt mix (10mA/ each 
deoxyribonucleoside triphosphate [dNTP]), Boehringer Manheim, #1814, 362. 
10X Pfx Amplification buffer, pH 9.0, GibcoBRLR, part # 52806, formulation is 
proprietary information. 50mM MgS0 4 , GibcoBRLR, part #52044. Platinuma" 
Pfx DNA Polymerase (B Locus only), GibcoBRLR, 11708-013. Pfu DNA 
Polymerase (A and C Locus), Promega, M7741. Pfu DNA Polymerase lOx 
reaction Buffer with MgS0 4 , 200mA/ Tris-HCL, pH 8.8,100mM KCI, 100mA/ 
(NH 4 ) 2 S0 4 , 20mA/ MgS0 4 , lmg/ml nuclease free BSA,1% TritonrX-100. gDNA 
Template. Amplification primers (in ng/ul): 

a. A locus: 5' sense PP5UTA (300); 3'antisense PPI4A (300) 

b. B locus (Not B*39's): sense PP5UTB (300); antisense PPI4B (300) 

c. B locus (B*39's): sense 5UTB39 (300); antisense PPI4B (300) 

d. C Locus: sense 5PKCE (300); antisense PPI4C (300). 

B. Secondary PCR (also used for colony PCR) 

H z O: Dionized ultra filtered water (DIUF) Fisher Scientific, W2-4,41. PCR 
nt mix (10mA/ each deoxyribonucleoside triphosphate [dNTP]), Boehringer 
Manheim, #1814, 362. Pfu DNA Polymerase, Promega, M7741. Pfu DNA 
Polymerase lOx reaction Buffer with MgS0 4 , 200mA/ Tris-HCL,pH 8.8,100mM 
KCI, 100mA/ (NH 4 ) 2 S0 4 , 20mA/ MgS0 4 , lmg/ml nuclease free BSA,1% TritonrX- 
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100. Template 1:100 dilution of the primary PCR product. Amplification 
primers (in ng/ul): 

a. A-locus: 5' sense PP5UTA (300), 3' antisense PP3PEI (300) 

b. B-locus: sense PP5UTB (300), antisense PP3PEI (300) 

c. B-locus: sense PP5UT (300), antisense PP3PEIH (300) 

d. B-locus B39's: sense 5UTB39 (300), antisense PP3PEIH (300) 

e. C-locus: sense 5PKCE (300), antisense PP3PEI (300) 

f. C-locus Cw*7's; sense 5PKCE (300), antisense 3PEIHC7 (300) 

C. Gel Purification of PCR products and vectors 

Dark Reader Transilluminator Model DR-45M, Clare Chemical Research. 
SYBR Green, Molecular Probes Inc. Quantum Prep Freeze 'N Squeeze DNA Gel 
Extraction Spin Columns, Bio-Rad Laboratories, 732-6165. 

D. Restriction digests, Ligation and Transformation 

^Restriction enzymes from New England Biolabs: EcoR I #R0101S; Hind 
III #R0104S; Xba I #R0145S. T4 DNA Ligase, New England Biolabs, 
#M0202S. pcDNA3.1(-), Invitrogen Corporation, V795-20. lOx Buffers from 
New England Biolabs. EcoR I buffer, 500mM NaCI, lOOOmM Tris-HCL, lOmM 
MgCL 2 , 0.25% Triton-X 100, pH 7.5. T4 DNA ligase buffer, 500mM Tris-HCL, 
lOOmM MgCL 2/ lOOmM DTT, lOmM ATP, 250ug/ml BSA, pH 7.5. NEB buffer 2, 
500mM NaCI, 100 mM Tris-HCI, lOOmM MgCI 2 , lOmM DDT, pH 7.9. lOOx BSA, 
New England Biolabs. Z-Competent E. coli Transformation Buffer Set, Zymo 
Research, T3002. E. coli strain JM109. LB Plates with lOOug/ml ampicillin. LB 
media with lOOug/ml ampicillin 

E. Plasmid Extraction 



-107- 



WO 02/062846 

Wizard Plus SV minipreps, Promega, #A1460 
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3. Sequencing of Clones 

Thermo Sequenase Primer Cycle Sequencing Kit, Amersham Pharmacia 
Biotech, 25-2538-01. CY5 labeled primers (Table 18). Alfexpress automated 
DMA sequencer, Amersham Pharmacia Biotech. 

4. Gel Casting 

PagePIus 40% concentrate, Amresco, E562, 500ml. Urea, Amersham 
Pharmacia Biotech, 17-0889-01,500g. N'N'N'N'-tetramethylethyleneiamine 
(TEMED), APB. Ammonium persulphate (10% solution), APB. Boric acid,APB. 
EDTA-disodium salt, APB. Tris, APB. Bind-Saline, APB. Isopropanol, Sigma. 
Glacial Acetic Acid, Fisher Biotech. DIUF water, Fisher Scientific. Ethanol 200- 
proof. 

5. Plasmid Preparation for Electroporation 
Qiagen Plasmid Midi kit, Qiagen Inc., 12143. 
A. Electroporation 

Biorad Gene Pulser with capacitance extender, Bio-Rad Laboratories. 
Gene Pulser Cuvette, Bio-Rad Laboratories. Cytomix: 120mM KCI, 0.15mM 
CaCI 2 , 10mMK 2 HP(VKH 2 PO 4 , pH 7.6, 25mM Hepes,pH 7.6, 2mM EGTA, pH 7.6, 
5mM MgCI 2 , pH 7.6 with KOH. RPMI 1640+ 20% Foetal Calf Serum + 
Pen/strep. Haemacytometer. Light Microscope. C0 2 37° Incubator. Cells in 
log phase. 
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Methods used for the production of soluble human HLA Class I and II 
proteins in mammalian cells from gDNA. 
1. Primary PCR 

A. A-Locus and C-Locus 



lOx Pfu buffer 5ul 

5' Primer (300ng/ul) lul 

3' Primer (300ng/ul) lul 

dNTP's (lOmM each) lul 

gDNA (»50ng/ul) lOul 

DIUF H 2 0 31.4ul 

Pfu DNA Polymerase 0.6ul 

96°C 2 min. 

95°C 1 min x35 

58°C 1 min x35 

73°C 5 min x35 
73°C 10 min 

B. B-locus 

lOx Pfx buffer 5ul 

5' Primer (300ng/ul) lul 

3' Primer (300ng/ul) lul 

dNTP's (lOmM each) 1.5ul 

MgS0 4 (50mM) lul 

gDNA (lOOng/ul) lul 

DIUF H 2 0 40ul 

Pfx DNA Polymerase 0.5ul 
94°C 2 min. 

94°C 1 min x35 

60°C 1 min x35 

68°C 3.5 min x35 
68°C 5 min 



2. Gel Purification of PCR (all PCR and plasmids are gel purified) 

Mix primary PCR with 5ul of lOx SYBR green and incubate at room 
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temperature for 15 minutes then load on a 2% agarose gel. Visualize on the 
Dark Reader and purify using the Quantum Prep Freeze and Squeeze extraction 
kit according to the manufacturers instructions. 
3. Secondary PCR of A, B and C Loci 



lOx Pfu buffer 5ul 

5' Primer (300ng/ul) l.Oul 

3' Primer (300ng/ul) l.Oul 

dNTP's (lOmM each) lul 

1:100 1° PCR lOul 

DIUF H 2 0 31.4ul 

Pfu DNA Polymerase 0.6ul 
96°C 2 min. 



95°C 1 min x35 

56°C 1 min x35 

73°C 4 min x35 
73°C 7 min 

4. Restriction digests 



2° PCR (gel purified) 30ul 

Restriction enzyme 1 Xul 

Restriction enzyme 2 Xul 

lOx buffer 5ul 

lOOx BSA 0.5ul 

DIUF H 2 0 10.5ul 



The cut sites incorporated into the PCR primers for each individual PCR 
will determine the enzymes used. The expression vector pcDI\IA3.1(-) will be cut 
in a similar manner. 
5. Ligation 



pcDNA3.1(-) cut with same enzymes as PCR x ng 

Cut PCR y ng 

lOx T4 DNA figase buffer 2ul 

T4 DNA Ligase lul 

DIUF H 2 0 up to 20ul 



-110- 



WO 02/062846 PCT/USO 1/49744 

The ratio of vector to insert will vary between samples, a good starting 
point is a ratio of 1:6 

6. Transformation 

Transform JM109 using competent cells made using Z-competent E.coM 
Transformation Kit and Buffer Set. 

7. Colony PCR and restriction digests. 

Either following the secondary PCR protocol or carrying out another 
restriction digest can be used to screen the vector for the appropriate insert. 

8. Mini Preps of colonies with insert 

Use the Wizard Pius SV minipreps and follow the manufacturers 
instructions. Make glycerol stocks before beginning extraction protocol. 

9. Sequencing of positive clones and gel casting 

Using the Thermo Sequenase Primer Cycle Sequencing Kit 



A,C,G or T mix 3ul 

CY5 Primer lpm/ul lul 

DNA template lOOng/ul 5ul 
96°C 2 min 

96°C 30 sec x25 

61°C 30 sec x25 



Add 6 ul formamide loading buffer and load lOul onto sequencing gel 
Analyse sequence for good clones with no misincorporations. 
10. Gel Casting 

A. Prepare a 10XTBE stock solution for the sequencing gel mix: 

500mL 
Tris 60. 5g 



-111- 



WO 02/062846 PCT/USO 1/49744 

EDTA 1.85'g 
Boric Acid 25. 5g 

Fisher DIUF H z O 440mL 

Filter using a 0.22m or 0.45m filter and store at 4°C until required. 

11. Prepare a 10X TBE stock solution for the running buffer: 

1L 

Tris 121. Og 

EDTA 3.7g 
Boric Acid 51. Og 

Ultra pure H 2 0 880 mL 

12. Prepare the sequencing gel mix: 

1 Gel. 6% PaaePlus 
Urea 19.8g 
PagePlus 40% cone. 7.95mL 
10X TBE 5.5mL 
Fisher DIUF H 2 0 25.3mL 
Filter using a 0.22m or 0.45m filter. 

Initiate polymerization of the sequencing gel by adding 330uLof a freshly made 

10% APS solution and 33uL of TEMED. Cast a 0.5mm sequencing gel and allow 

it to polymerize for 5 hours. 

13. Midi Preps 

Prepare plasmid for electroporation using the Qiagen Plasmid Midi Kit 
according to the manufacturers instructions. 

14. Electroporation 

Electroporations are performed as described in "The Bw4 public epitope 
of HLA-B molecules confers reactivity with natural killer cell clones that express 
NKbl, a putative HLA receptor. Gumperz, J.E., V. Litwin, J.H. Phillips, L.L. 
Lanier and P. Parham. J. Exp. Med. 181:1133-1144, 1995," which is herein 
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expressly incorporated by reference in its entirety. 
15. Screening for production of Soluble HLA 

An ELISA Is used to screen for the production of soluble HLA, see ELISA 
protocol. 
ELISA Protocol 

Solutions: 
3 N H 7 S(X,: 

For 500 ml: 200 ml H 2 0 

300 ml 5 N H 2 S0 4 

Store at room-temperature 
lOx PBS fpH 7.41: 



11.5 g Na 2 HP0 4 

Add H 2 0 

Adjust pH to 7.4 

Add up with H 2 0 to 1000 ml 

Filter 

Store at 4°C 



26.8 mM 
14.7 mM 
1.37 M 
81 mM 



KCI 

KH 2 P0 4 

NaCI 

Na 2 HP0 4 



For 1000 ml: 



2g 
2g 
80 g 



KCI 

KH 2 P0 4 anhydrous 
NaCI 



TBS coating buffer foH 8.51; 



25 mM 
150 mM 



Tris-HCI 
NaCI 



pH 8.5 
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For 1000 ml: 



100 ml 
900 ml 



lOx 



TBS 
H 2 0 



pH 8.5 



Store at 4° C 



IP % BSA 

For 250 ml: 



25 g 



BSA 



Add H z O 

Stir to dissolve 

Add H z O up to 225 ml 

Filter 

Store at 4° C 

(Add lOx PBS prior to use) 



ELISA WASH 



2.68 mM 
1.47 mM 
137 mM 
8.1 mM 
0.05 % 



KCI 

KH 2 P0 4 
NaCI 
Na 2 HP0 4 
Tween-20 



For 4000 ml: 



400 ml 
3598 ml 
2 ml 



lOx PBS 
H 2 0 
Tween-20 



Store at 4°C 
OPD Substrate Solution for HRP: 



For 10 ml: 



0.05 M 
0.025 M 
0.4 mg/ml 
0.012 % 

10 ml 
2 tablets 
4 |Jl 30 % 



Na 2 HP0 4 pH 5.0 

Citrate 

OPD 

H 2 0 2 



0.05 M Phosphate-citrate buffer pH 5.0 

OPD (2 mg ea) (Sigma; P-6787) 

Hydrogen peroxide (H 2 0 2 ) (Sigma 
H1009) 
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Always prepare fresh 

Use within 1 hour of preparation 

Add fresh 30 % H 2 0 2 immediately prior to use 



0,05 M Phosphate-Citrate buffer: 



0.05 M 
0.025 M 



Na 2 HP0 4 pH 5.0 
Citrate 



For 1000 ml: 



7.10 g 
5.25 g 



Na 2 HP0 4 anhydrous (Dibasic) 
Citric acid monohydrate 



Add ddH 2 0 

Adjust pH to 5.0, if necessary 

Add ddH 2 0 to a final volume of 1000 ml 

Filter 

Store at room temperature 



The HLA ELISA procedure 

For biochemical analysis, monomorphic monoclonal antibodies are 
particularly useful for identification of HLA locus products and their subtypes 
W6/32 is one of the most common monoclonal antibodies (mAb) used to 
characterize human class I major histocompatibility complex (MHC) molecules. 
It is directed against monomorphic determinants on HLA-A, -B and -C heavy 
chains, which recognizes only mature complexed class I molecules and 
recognizes a conformational epitope on the intact MHC molecule containing both 
beta2-microglobulin (b2m) and the heavy chain (HC). W6/32 binds a compact 
epitope on the class I molecule that includes both residue 3 of beta2m and 
residue 121 of the heavy chain (Ladasky Jj, Shum BP, Canavez F, Seuanez HN, 
Parham P. Residue 3 of beta2-microglobulin affects binding of class I MHC 
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molecules by the W6/32 antibody. Immunogenetlcs 1999 Apr;49(4):312-20.). 
The constant portion of the molecule W6/32 binds to is recognized by CTLs and 
thus can inhibit cytotoxicity. The reactivity of W6/32 is sensitive to the amino 
terminus of human beta2-microg!obulin (Shields MJ, Ribaudo RK. Mapping of 
the monoclonal antibody W6/32: sensitivity to the amino terminus of beta2- 
microglobulin. Tissue Antigens 1998 May;51(5):567-70). W6/32 is available 
biotinylated (Serotec MCA81B) offering additional variations in ELISA 
procedures. 

Anti-human b2m (HRP) (DAKO P0174) recognizes denatured as well as 
complexed b2m. Although in principle anti-b2m reagents could be used for the 
purpose of identification of HLA molecules, they are less suitable when 
association of heavy chain and b2m is weak. The patterns of class I molecules 
precipitated with W6/32 and anti-b2m are usually indistinguishable [Vasilov, 
1983 #10]. 

Rabbit anti-b2-microglobulin dissociates b2-microglobulin from heavy 
chain as a consequence of binding (Rogers, M.J., Appella, E., Pierotti, M. A., 
Invernizzi, G., and Parmiani, G. (1979) Proc Natl. Acad. Sci. U.S.A. 76, 1415- 
1419). It also has been reported that rabbit anti-human b2-microglobulin 
dissociates b2-microglobulin from HLA heavy chains upon binding (Nakamuro, 
K., Tanigaki, N., and Pressman, D. (1977) Immunology 32, 139-146.). This 
anti-human b2m antibody is also available unconjugated (DAKO A0072). 
The W6/32-HLA sandwich ELISA 

Sandwich assays can be used to study a number of aspects of protein 



-116- 



WO 02/062846 PCT/USO 1/49744 

complexes. If antibodies are available to different components of a 
heteropotymer, a two-antibody assay can be designed to test for the presence 
of the complex. Using a variation of these assays, monoclonal antibodies can 
be used to test whether a given antigen is multimeric. 

The W6/32 - anti-b2m antibody sandwich assay is one of the best 
techniques for determining the presence and quantity of sHLA. Two antibody 
sandwich assays are quick and accurate, and if a source of pure antigen is 
available, the assay can be used to determine the absolute amounts of antigen 
in unknown samples. The assay requires two antibodies that bind to non- 
overlapping epitopes on the antigen. This assay is particularly useful to study 
a number of aspects of protein complexes. 

To detect the antigen (sHLA), the wells of microtiter plates are coated 
with the specific (capture) antibody W6/32 followed by the incubation with test 
solutions containing antigen. Unbound antigen is washed out and a different 
antigen-specific antibody (anti-b2m) conjugated to HRP is added, followed by 
another incubation. Unbound conjugate is washed out and substrate is added. 
After another incubation, the degree of substrate hydrolysis is measured. The 
amount of substrate hydrolyzed is proportional to the amount of antigen in the 
test solution. 

The major advantages of this technique are that the antigen does not 
need to be purified prior to use and that the assays are very specific. The 
sensitivity of the assay depends on 4 factors: 

(1) The number of capture antibody 

(2) The avidity of the capture antibody for the antigen 

-117- 



WO 02/062846 



PCT/US01/49744 



(3) The avidity of the second antibody for the antigen 

(4) The specific activity of the labeled second antibody 

Use an ELISA protocol template and label a clear 96-well polystyrene assay 
plate. Polystyrene is normally used as a microtiter plate. (Because it is not 
translucent, enzyme assays that will be quantitated by a plate reader should be 
performed in polystyrene and not PVC plates). 



Company 


Plate 


Specificity 


Cat# 










Nunc 


Maxisorp 
StarWell Modules 
Framed 8-well 
strips 


standard/un 
treated 


441653 



Coating of the W6/32 should be performed in Tris buffered saline (TBS); 
pH 8.5. Prepare a coating solution of 8.0 pg/ml of specific W6/32 antibody in 
TBS (pH 8.5). Although this is well above the capacity of a microtiter plate, 
the binding will occur more rapidly. Higher concentrations will speed the binding 
of antigen to the polystyrene but the capacity of the plastic is only about 100 
ng/well (300 ng/cm 2 ), so the extra protein will not bind. If using W6/32 of 
unknown composition or concentration, first titrate the amount of standard 
antibody solution needed to coat the plate versus a fixed, high concentration 
of labeled antigen. Plot the values and select the lowest level that will yield a 
strong signal. Do not include sodium azide in any solutions when horseradish 
peroxidase is used for detection. 

Immediately coat the microtiter plate with 100 |J' per well using a 
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multichannel pipet. Standard polystyrene will bind antibodies or antigens when 
the proteins are simply incubated with the plastic. The bonds that hold the 
proteins are non-covalent, but the exact types of interactions are not known. 
Shake the plate to ensure that the antigen solution is evenly distributed over 
the bottom of each well. Seal the plate with plate sealers (sealplate adhesive 
sealing film, nonsterile, 100 per unit; Phenix (1-800 767-0665); LMT-Seal-EX) 
or sealing tape to Nunc-Immuno™ Modules (# 236366). Incubate at 4°C 
overnight. Avoid detergents and extraneous proteins. Next day, remove the 
contents of the well by flicking the liquid into the sink or a suitable waste 
container. Remove last traces of solution by inverting the plate and blotting it 
against clean paper toweling. Complete removal of liquid at each step is 
essential for good performance. Wash the plate 10 times with Wash Buffer 
(PBS containing 0.05 % Tween-20) using a multi-channel ELISA washer. 

After the last wash, remove any remaining Wash Buffer by inverting the 
plate and blotting it against clean paper toweling. After the W6/32 is bound, 
the remaining sites on the plate must be saturated by incubating with blocking 
buffer made of 3% BSA in PBS. Fill the wells with 200 |Jl blocking buffer. Cover 
the plates with an adhesive strip and incubate overnight at 4°C. Alternatively, 
incubate for at least 2 hours at room temperature which is, however, not the 
standard procedure. Blocked plates may be stored for at least 5 days at4°C. 
Good pipetting practice is most important to produce reliable quantitative 
results. The tips are just as important a part of the system as the pipette itself. 
If they are of inferior quality or do not fit exactly, even the best pipette cannot 
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produce satisfactory results. 

When maximum levels of accuracy are stipulated, prewetting should be 
used at all times. To do this, the required set volume is first drawn in one or 
two times using the same tip and then returned. Prewetting is absolutely 
necessary on the more difficult liquids such as 3% BSA. Do not prewet, if your 
intention is to mix your pipetted sample thoroughly with an already present 
solution. However, prewet only for volumes greater than 10 pi. In the case of 
pipettes for volumes less than 10 pi the residual liquid film is as a rule taken 
into account when designing and adjusting the instrument. The tips must be 
changed between each individual sample. With volumes <10 pi special 
attention must also be paid to drawing in the liquid slowly, otherwise the 
sample will be significantly warmed up by the frictionai heat generated. Then 
slowly withdraw the tip from the liquid, if necessary wiping off any drops 
clinging to the outside. To dispense the set volume hold the tip at a slight 
angle, press it down uniformly as far as the first stop. 

In order to reduce the effects of surface tension, the tip should be in 
contact with the side of the container when the liquid is dispensed. After liquid 
has been discharged with the metering stroke, a short pause is made to enable 
the liquid running down the inside of the tip to collect at its lower end. Then 
press it down swiftly to the second stop, in order to blow out the tip with the 
extended stroke with which the residual liquid can be blown out. In cases that 
are not problematic (e.g. aqueous solutions) this brings about a rapid and 
virtually complete discharge of the set volume. In more difficult cases, a slower 
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discharge and a longer pause before actuating the extended stroke can help. 
To determine the absolute amount of antigen (sHLA), sample values are 
compared with those obtained using known amounts of pure unlabeled antigen 
in a standard curve. 

For accurate quantitation, all samples have to be run in triplicate, and the 
standard antigen-dilution series should be included on each plate. Pipetting 
should be preformed without delay to minimize differences in time of incubation 
between samples. All dilutions should be done in blocking buffer. Thus, 
prepare a standard antigen-dilution series by successive dilutions of the 
homologous antigen stock in 3% BSA in PBS blocking buffer. In order to 
measure the amount of antigen in a test sample, the standard antigen-dilution 
series needs to span most of the dynamic range of binding. This range spans 
from 5 to 100 ng sHLA/mi. 

A stock solution of 1 \ig/m\ should be prepared, aliquoted in volumes of 
300 pi and stored at 4°C. Prepare a 50 ml batch of standard at the time. (New 
batches need to be compared to the old batch before used in quantitation). 
Use a tube of the standard stock solution to prepare successive dilutions 
according to the scheme below. While standard curves are necessary to 
accurately measure the amount of antigen in test samples, they are 
unnecessary for qualitative "yes/no" answers. 

For accurate quantitation, the test solutions containing sHLA should be 
assayed over a number of at least 4 dilutions to assure to be within the range 
of the standard curve. Prepare serial dilutions of each antigen test solution in 
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blocking buffer (3% BSA in PBS). Standard dilutions for purified, crude or flow 
through samples are given below: 

After mixing, prepare all dilutions in disposable U-bottom 96 well 
microtiter plates before adding them to the W6/32-coated plates with a 
multipipette. Add 150 pi in each well. To further proceed, remove any 
remaining blocking buffer and wash the plate as described above. The plates 
are now ready for sample addition. Add 100 |Jl of the sHLA containing test 
solutions and the standard antigen dilutions to the antibody-coated wells. 
Cover the plates with an adhesive strip and incubate for exactly 1 hour at room 
temperature. After incubation, remove the unbound antigen by washing the 
plate lOx with Wash Buffer (PBS containing 0.05 % Tween-20) as described. 

Prepare the appropriate developing reagent to detect sHLA. Use the 
second specific antibody, anti-human b2m-HRP (DAKO P0174 / 0.4 mg/ml) 
conjugated to Horseradish Peroxidase (HRP). Dilute the anti-human b2m-HRP 
in a ratio of lrl'OOO in 3% BSA in PBS. (Do not include sodium azide in 
solutions when horseradish peroxidase is used for detection). 



No. of Total anti-b2m-HRP 3%BSA 

plates Antibody in PBS 

Mix: 1 10 ml 10 pi 10 ml 

2 20 ml 20 |Jl 20 ml 

3 30 ml 30 |Jl 30 ml 

4 40 ml 40 |Jl 40 ml 

5 50 ml 50 Ml 50 ml 



Add 100 M< of the secondary antibody dilution to each well. All dilutions 
should be done in blocking buffer. Cover with a new adhesive strip and 
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incubate for 20 minutes at room temperature. Prepare the appropriate amount 
of substrate prior to the wash step. Bring the substrate to room temperature. 
OPD (o-Phenylenediamine) is a peroxidase substrate suitable for use in ELISA 
procedures. The substrate produces a soluble end product that is yellow in 
color. The OPD reaction is stopped with 3 N H 2 S0 4/ producing an orange-brown 
product and read at 492 nm. Prepare OPD fresh from tablets (Sigma, P6787; 
2 mg/tablet). The solid tablets are convenient to use when small quantities of 
the substrate are required. 

After second antibody incubation, remove the unbound secondary reagent 
by washing the plate lOx with Wash Buffer (PBS containing 0.05% Tween-20). 
After the final wash, add 100 pi of the OPD substrate solution to each well and 
allow to develop at room temperature for 10 minutes. Reagents of the 
developing system are light-sensitive, thus, avoid placing the plate in direct 
light. Prepare the 3 N H 2 S0 4 stop solution. After 10 minutes, add 100 |Ji of 
stop solution per 100 |Jl of reaction mixture to each well. Gently tap the plate 
to ensure thorough mixing. Read the ELISA plate at a wavelength of 490 nm 
within a time period of 1.5 minutes after stopping the reaction. 

The background should be around 0.1. If the background is higher, the 
substrate may have been contaminated with a peroxidase. If the subtrate 
background is low and the background in your assay is high, this may be due 
to insufficient blocking. Finally analyze your readings. Prepare a standard 
curve constructed from the data produced by serial dilutions of the standard 
antigen. To determine the absolute amount of antigen, compare these values 
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with those obtained from the standard curve. 

After running the ELISA on various transfectants it is possible to determine 
which ones will produce the highest levels of soluble HLA molecules. These 
"good producers" are grown to the point where a cell pharm can be seeded. 
The harvest from the cell pharm is then used to extract the soluble HLA using 
the following purification protocol. 

To start the chromatography procedure, prepare the AKTA™ prime 
system. The system can be used immediately but the spectrophotometers full 
ability will not be obtained until after 1 hour of lamp warm-up. To prepare the 
system for a run, check that the buffer inlet tubings are immersed in the 
correct buffer vessels and the waste tubings are put into a waste bottle. Only 
use degassed and filtered liquids to make sure that the liquid remains free from 
air bubbles. Degass by applying a vacuum to the solution. Prepare and hook 
up the buffers necessary for a sHLA purification: 
Buffer Valve (A) 



1. 


PBS, pH 7.4 


(Wash buffer) 


2. 


20 % Ethanol/ 70 % Ethanol 


(Cleaning solutions) 


3. 


0.1 N NaO 


(MOK elution buffer) 


4. 


50 mM Diethylamine (DEA), pH 11.3 


(MOK elution buffer) 


5. 


Protein sample 


(The line is stored in 






PBS/0.05% Na Azide, pH 7.4) 


6. 


0.2 N Acetic acid, pH ~2.7 


(Cleaning & MOK solution) 


7. 


0.1 M Citric acid, pH 3.0 


(Protein A elution buffer) 


8. 


0.1 M Glycine, pH 11.0 


(sHLA elution buffer) 



It is important to purge the lines after a new hook-up with about 50 ml of 
liquid to get the air out of the system. Purging can be done manually through 
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the inlets of the buffer valve (A1-A8), while carefully immersing the tubing in 
the respective liquid. To remove any trapped air bubbles in the flow path, 
purge the pump in the order PBS / 20 % ethanol / PBS / final buffer solution. 
Next, prepare the recorder to monitor the purification. Autozero the built-in UV 
spectrophotometer with PBS as reference. Equilibrate all material to the 
temperature at which the chromatography will be performed. For large scale 
purifications, attach the column entrance/exit to the system. 

Equilibrate the column by passing 10 bed-volumes of PBS over the matrix. 
Before starting any column purification, the protein concentration in the sample 
solution should be determined using a quantitative ELISA procedure. The 
sample volume loaded will depend on the size and loading capacity of the 
column and the concentration of the sample. Calculate the volume of the 
sample solution maximally saturating the column according to the columns 
capacity to bind the antigen. 

(A) Antigen concentration: mg/ml antigen 

(B) Antigen binding capacity: mg antigen/ml gel 

(C) Matrix volume: ml gel 

(D) Maximal amount of antigen: (B*C) mg 

(E) Sample volume: (D/A) ml 

Since the binding capacity of the column will realistically not be reached, 
a much lower volume of sample solution should be chosen. A value between 40 
to 50% of the calculated volume is more accurate which also will not result in 
the waste of lots of unbound antibody within the flow-through. Prepare the 
antibody sample solution for purification. Spin crude harvest at 5*000 rpm for 
25 minutes (JA10 rotor) to remove lipid and cell debris. The antigen solution 
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must be free of particulate matter. Pour the supernatant into a suitable 

container. Prevent air bubble formation. 

Name of the crude harvest: 
Volume used: ml 
Amount of sample: mg 

The simplest method to bind the antigen to the antibody/Sepharose 4B 
matrix is to apply the sample through the system pump and pass the protein 
solution down the column. Set appropriate parameters to record the loading 
conditions on the recorder. 





Chart Speed 


Conductivity 


Optical Density 










Load 


0.1 mm/min 


0.5 V 


1.0V 











Save a 1 ml probe from the starting material (LOAD) before the 
purification procedure for analysis purposes. Set the buffer valve to position 
5 and the injection valve to position LOAD. Make sure the inlet tubings are 
purged with sample buffer without any air-bubbles present. To have a purged 
sample line, disconnect shortly the column before loading and circulate the 
sample within the system with higher flow rate. Pass the solution slowly 
through the column with a flow rate of approximately 1.0 ml/min or lower to 
give the protein time to bind more efficiently. Higher flow rates will decrease 
efficiency. A disruption in flow may cause a rapid rise in back-pressure. If this 
occurs, immediately shut off the pump and check the gel bed for compression. 

Collect the flow-through in an appropriate container. Keep until you are 
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sure all material has bound to the column and negligible amounts are In the 
flow through. Take a sample at the end of the run (Ft) which should be 
analyzed- Wash the column with PBS at 10 ml/min until UV absorbance at 280 
nm is zero. For a large column use 2000-3000 ml wash buffer (PBS). Save the 
wash in a container until after the purification- 





Chart Speed 


Conductivity 


Optical 
Density 










Wash 


0.5 mm/min 


0.5 V 


1.0 V 











Prepare borosilicate collection tubes by adding 1.2 ml of 1 M Tris-HCI, pH 
7.0 per 4.8 ml of fraction to be collected (1:4). Neutralization is a safety 
measure to preserve the activity of the eluted molecule. MHC class I (sHLA) 
molecules are best eluted from a W6/32 column by 0.1 M glycine, pH 11.0. 
Absorbance is used for generating a protein eiution profile. 





Chart Speed 


Conductivity 


Optical Density 










Blution 


0.5 mm/sec 


0.2 V 


0.1 V 











Place the collector arm over the first collection tube. Elute 4.8 ml per 
fraction at 10 ml/min. Immediately afterwards, mix each tube gently to bring 
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the pH back to neutral. As with all protein solutions, avoid bubbling or frothing 
as this denatures the proteins. If a very low amount of protein is expected, 
change the conductivity on the recorder to a lower value. Identify the antigen- 
containing fractions by absorbance at 280 nm on the chart and combine them 
during up-concentration. Up-concentrate immediately and buffer exchange into 
PBS using MACROSEP™ centrifugal concentrators (Pall Filtron; Northborough, 
MA; MACROSEP 10K; OD010C37). Keep the protein on ice at all times and 
centrifuge at 4°C. After the buffer exchange, prepare the sample for storage 
at 4°C. Filter the pure samples through a 0.2 |J filter and aliquot directly into 
sterile, screw cap tubes. Label appropriately. 

Determine the absorbance at 280 nm as well as the protein concentration 
with the Micro BCA kit. Activity can be determined with a regular ELISA 
procedure. The purity of the eluted sHLA can be assessed by SDS-PAGE, 
Western blotting or performing a Superdex column analysis. After the elution, 
quickly re-equilibrate the column with PBS to avoid denaturation of the W6/32 
antibody linked to it. For analytical work in which more than one allele will be 
purified on the same column, extreme care must be taken. To be able to re-use 
the column, start a maintenance procedure after the re-equilibration. Cleaning- 
in-place is a procedure, which removes contaminants such as lipids, precipitates 
or denatured proteins that may remain in the column after regeneration. Such 
contaminations are especially likely when working with crude materials. The 
procedure helps to maintain the capacity, flow properties and general 
performance. 
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Mock elute the column using buffers with alternating pH. Start running 
over 10 gel volumes of 0.2 N acetic acid followed by 10 gel volumes of 50 mM 
diethylamine, pH 11.3 at a speed of 10 ml/min. Repeat three times and always 
equilibrate with 10 gel volumes PBS between buffer changes. 





Chart Speed 


Conductivity 


Optical 
Density 










Mok-elution 


1.0 mm/min 


0.2 V 


0.1 V 











After Mok-elution, store the column at 4°C in PBS/0.05% Na Azide. 
Sanitization inactivates microbial contaminants in the packed column and 
related equipment. One generally recommended procedure is to wash 
alternately with high and low pH buffers as performed in the coupling reaction. 
For. sanitization, disassemble the column and wash the matrix alternately with 
low pH wash buffer (0.1 M sodium acetate containing 0.5 M NaCI, pH 4.0) and 
high pH wash buffer (0.1 M Tris-HCl containing 0.5 M NaCI, pH 8.0) for 3 times 
followed by re-equilibration with PBS. Re-assemble the cleaned and sterilized 
column and store it at 4°C in PBS containing 0.05% sodium azide. 

After the column is removed, the AKTA™ prime system has to be cleaned 
carefully. Start with the cleaning of line 5, where the sample was hooked up. 
Rinse the system pump and include the fraction collector line. First clean the 
inlet tubings, by manually running the system pump and flushing with 0.2 N 
acetic acid at 30 ml/min followed by 0.1 N NaOH. Always equilibrate with PBS. 
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Don't forget to add a line between the injection valve and the UV detector as 
a bridge, as replacement of the column. Finally, rinse with 20% ethanol. If the 
column was sanitized because of bacterial contamination, rinse with 70% 
ethanol. 

In order to test whether or not the sHLA molecules produced were 
conformational, an assay was developed using a variety of different antibodies: 

1. w6/32, recognizes conformational intact trimer 

2. hclO, recognizes denatured HLA molecules 

3. anti b 2 M, recognizes free and non-covalently associated Beta 2 
microglobulin 

The methodology for this assay is essentially the same as the standard 
Elisa described above, except that instead of coating the plate with w6/32, the 
soluble HLA molecules are coated directly to it. The three different antibodies 
are then utilized in three different ways to detect the bound HLA. W6/32 is 
biotin labeled, therefore Vectastain kit and OPD are used for detection, anti b 2 M 
is conjugated to horse radish peroxidase and so can be directly detected using 
OPD and with hclO a secondary anti-mouse IgG horse radish peroxidase 
conjugated antibody and OPD are used. 

Results for sHLA Production Using qDNA as the Starting Material 
Using sample 3A394 

Genomic PIMA extraction. 

Using 200ul whole blood, a Qiagen DNA extraction was performed. Table 
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19 shows the optical density readings and concentration of this extraction. 



Primary PCR of 3A394 

lOx pfu buffer 5ul 
PP5UTA lul 
3PPI4A lul 
dNTP lul 
gDNA lOul 
H 2 0 31.4ul 
Pfu lul 



See FIG. 30 for the gel image of this PCR. 

FIG 30. 2% agarose gel showing the primary PCR product of 3A394 (5 th lane 
from the right) at approximately 2kb in size. 

The PCR product was gel purified and this along with unpurified PCR was used 

in the secondary PCR at a 1:100 dilution. 

Secondary PCR of 3A394 

lOx pfu buffer 5ul 
dNTP lul 
PP5UTA lul 
PP3PEI lul 
1:100 1°PCR lOul 
H 2 0 31.4ul 
Pfu 0.6ul 

The results of this can be seen in FIG. 31. Secondary PCR of 3A394. Gel 
purified and unpurified can be seen in lanes 2 and 3 of. FIG. 31. 

The PCR product derived from gel purified primary PCR was in turn gel 
purified and used in the restriction digest in order to ligate into pcDNA 3.1(-)/ 
FIG. 32. 
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FIG. 32 is a gel showing 3A394 and pcDNA3.1 digested with EcoR I and 
Xba I. Once the ligations, at three different insert to vector ratios, and 
transformation were completed colonies were picked, grown overnight and then 
the plasmid DNA was extracted. A restriction digest was performed to screen 
for insert as shown in FIG. 33 is a restriction digests of 3A394 clones. A vector 
to insert ratio of 1:6 is the most efficient. 

Following identification of positive clones the concentration of the vector 
was established by optical density (Table 20 ) and then diluted to 500ng/ul. 

Cycle sequencing was then performed on sample 3A394TPC1 and it was 
established that this was a good clone of HLA-A*1102 truncated at codon 298. 

The clone 3A394PC1 was then grown up and the plasmid containing 
A*1102 Truncated (A*1102T) was prepared for electroporation by performing 
a large scale plasmid extraction. The concentration of this was determined 
using optical density which can be seen in Table 21. 

30ug of this was then electroporated into the cell line LCL 721.221 with 
the decay times being recorded then, after 2 days the viability of the cells was 
determined, Table 22. 

The cells were then placed under G418 selection to screen for positive 
v transfectants, those exhibiting G418 resistance were screened for soluble HLA 
production using the Elisa, Table 23. Only two replicates were tested for this 
sample and all were done using undiluted supernatant. 

Samples 3A394TPC1 wells 1 and 2 were grown further and eventually well 
1 cells were seeded into a cell pharm. This protocol has been performed on 
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many HLA types as can be seen by the production of soluble HLA molecules in 
Table 24. 



Due to the fact that the monoclonal antibody w6/32 only recognizes the 
folded trimer complex of heavy chain-light chain-peptide the results of the 
ELISA demonstrate that we are producing conformational molecules originally 
derived from genomic DNA. 

FIG. 34 is a graph showing the comparative binding of three monoclonal 
antibodies to four different soluble HLA molecules. A*1102T is sample 
3A394TRClwell 1 from genomic DNA the other three are produced from cDNA. 
Three different amounts of soluble HLA were coated to the plate for each allele. 

The results of the comparative binding assay demonstrate several 
properties of the soluble HLA produced from genomic DNA. Coating a plate with 
more protein will not necessarily yield a higher signal to protein ratio. Soluble 
HLA from genomic DNA gives results comparable to that of cDNA constructs. 
The fact that all three antibodies bind this confirms the correct epitopes of the 
recombinant molecules are present. 

Due to the polymorphic nature of the HLA system production of many 
alleles as soluble molecules is very difficult as a viable cell line is required in 
order to make cDNA and quite often this is not available. Thus a method that 
allows us to produce many different alleles from a readily available starting 
point is Invaluable. Production of the sHLA from genomic DNA provides such 
a starting point. We have shown here that two simple PCR reactions allows us 
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to clone many, if not all, HLA Class I alleles from genomic DNA. 

The Elisa data allows us to test test how functional these molecule are. By 
using w6/32 and anti b 2 M to establish production levels we also provide 
information as to how much of the protein is in a trimeric form. The 
comparative Elisa data helps back this up as the ratio of w6/32:hcl0 needs to 
be greater than 1.0 in order for there to be more conformational molecule than 
denatured, this is shown to be the case. In summary we have developed a 
technique that will allow the production of virtually any HLA Class I molecule in 
a soluble form on demand. 

An exemplary useful product which can be obtained from the. mammalian 
cell line expressing such a genomic DNA construct is a cDNA clone encoding the 
desired class I or class II molecule. The cDNA clone encoding the desired class 
I or class II molecule is formed from the mRNA molecule encoding the desired 
class I molecule isolated from such mammalian cell line. The cDNA clone may 
be utilized for functional testing, as described in more detail herein below. 
Thus, gDNA clones can be used as a mechanism to Obtain cDNA clones of the 
desired class I or class II HLA molecule. 

The cDIMA clones may be transfected into a cell which is unable to splice 
introns and process the mRNA molecule and therefore would not express the 
MHC molecule encoded by the genomic DNA, such as insect cells or bacterial 
cells. In addition, these cell lines will also be deficient in peptide processing and 
loading, and therefore the soluble MHC molecules expressed from such cells will 
not contain peptides bound therein (referred to as free heavy chain HLA). Such 
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soluble, free heavy chain HLA can effectively be tested for epitope binding as 
well. That is, MHC made in cells which do not naturally load peptide can be 
experimentally loaded with the peptide of choice. The heavy chain, light chain, 
peptide trimer can be reassembled in vitro using a high affinity peptide to 
facilitate assembly. Alternatively, a cell deficient in peptide processing can be 
pulsed with peptide such that the trimolecular MHC complex forms. DNA 
encoding a peptide (also encoding an appropriate targeting signal) could also 
be co-transfected into the cell with the MHC so that the MHC molecule which 
emerges from the cell is loaded only with the desired peptide. In this way MHC 
molecules could be loaded with a single low affinity peptide so that replacement 
with test peptides in a binding assay are more controlled. 

Note that an advantage of secreting individual MHC molecules from a cell 
that naturally loads peptide is that the MHC molecule of interest is naturally 
loaded with thousands of different peptides. When used in a peptide binding 
assay, a synthetic peptide can therefore be compared to thousands of naturally 
loaded peptides. 

Another use for the sHLA produced functional testing, according to the 
methodology of the present invention is the peptide- MHC complex can be 
multimerized to form soluble peptide-MHC dimers or tetramers, or other 
multiple soluble peptide-MHC - mers, such as fivemers, sixmers, etc. which 
serve as ligands for CTLs. The tetramers can be mixed with CTLs in vitro or 
with CTLs from the blood of human subjects to identify antigenic peptides 
responsible for immune responses in humans. Altman et al (Science, 1996), 
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herein expressly incorporated by reference in its entirety, discloses a method 
of functional testing using tetramer technology; however, the method of 
Altman, however, only discloses one soluble MHC molecule which has been 
utilized in such a method, and Altaian's method faces the same disadvantages 
and defects described above for the prior art, that is, the method envisions 
isolating individual mRNA/cDNA molecules from hundreds of different, typed cell 
lines, and then manipulating the cDIMA molecules to produce the desired soluble 
MHC molecule. The methods of the present invention envision combining the 
tetramer technology with amplification of genomic DNA, cloning the genomic 
DNA fragment, and transfection of the resulting construct into a mammalian cell 
line followed by isolation of cDNA from such transfected cell line and 
transfection into a cell line deficient in peptide processing and loading, thereby 
removing the need to isolate hundreds of different, typed cell lines for obtaining 
the different cDNAs. 

MHC/peptide tetramers are widely utilized in the phenotypic analysis of T 

cells and in the study of T cell responses to pathological conditions such as viral 
infections and cancer. Current methodology for tetramer production consists of 
expressing the MHC class I heavy chain in bacterial or insect cells and refolding 
the heavy chain in the presence of p-2-microglobulin and a specific peptide 
ligand in vitro. 

The methodology of the present invention had two specific aims, although 
this should not be regarded as limiting: 1) to engineer a cDNA construct of a 
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class I heavy chain containing a BirA substrate peptide (bsp) sequence at its 3' 
end (C-terminus) which would enable its subsequent biotinylation and 2) to 
develop a novel means of tetramer production using a mammalian expression 
system. The mammalian system used was a B cell/T cell hybrid, the antigen- 
processing mutant cell line CEM x 721.174.T2 (T2). When pulsed with 
established HLA B*0702-presented peptides from HIV-infected CD4+ T cells, 
T2 cells transfected with the recombinant, truncated B*0702 HLA heavy chain 
secreted specific MHC/peptide complexes. Following enzymatic biotinylation, 
these complexes were combined with avidin to form B7 tetramers. A 
mammalian expression system affords several advantages over a prokaryotic 
system, such as allowing normal glycosylation of the class I heavy chain and 
eliminating the need to refold the MHC/peptide complex in vitro following 
expression. The MHC class I molecules are therefore naturally folded in the cell 
rather than artificially folded outside the cell. Indeed, such artificial folding of 
the MHC class I molecules outside of the cell results in a MHC class I molecule 
which differs from one found in a "natural" system (i.e. a human immune 
system) and as such, is not an appropriate basis upon which to conduct vaccine 
development. 

Abbreviations: MHC, major histocompatibility complex; p 2 m, p 2 _ 
microglobulin; HLA, human leukocyte antigen; sHLA, soluble human leukocyte 
antigen; bsp, BirA substrate peptide or biotinylation substrate peptide; CTL, 
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cytotoxic T lymphocytes; PCR, polymerase chain reaction; ELISA, enzyme- 
linked immunosorbent assay; HRP, horseradish peroxidase; PE, phycoerythrin; 
PBS, phosphate buffered saline; TAP, Transporters associated with Antigen 
Processing; TCR, T cell receptor; ER, endoplasmic reticulum; kDa, kiloDaltons. 

The structure and function of MHC class I. Unlike B lymphocytes which 
can interact with antigen in its intact form, T lymphocytes only recognize 
protein antigen broken down into peptide fragments and presented in 
association with specialized cell-surface molecules. These molecules are the 
gene products of the major histocompatibility complex (MHC) (FIGS. 1 and 35), 
a region on chromosome six known to encode proteins that are critical for 
immunologic specificity and transplantation histocompatitlblity. MHC molecules, 
also termed human leukocyte antigens (HLA), are cell-surface glycoproteins 
that function primarily in communicating to T cells the presence of intracellular 
or extracellular invaders. MHC/peptide complexes are recognized by the T cell 
receptor (TCR), an interaction enabling T cells to discriminate between MHC 
molecules bearing "foreign" antigen (viral/tumor/bacterial peptides) and MHC 
molecules displaying "self" peptides. An immune response can then be initiated 
against cells determined to harbor foreign antigen on the basis of the avidity of 
this binding interaction. 

MHC molecules are of two types, designated class I and class II. In 
general, class II molecules are located on antigen presenting cells (APCs), such 
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as macrophages or dendritic ceils, and present peptides from exogenously 
synthesized protein, usually the breakdown products from phagocytosed 
bacteria or protozoa. Thus, class II molecules primarily function to warn the 
immune system of extracellular invaders. The class II/peptide complex is 
recognized by and interacts with TCRs of CD4+ T cells (T helper or Th cells). 
Class I molecules are located on platelets and all nucleated cells of the body 
and bind peptides derived from endogenously synthesized proteins. Examples 
of endogenously synthesized proteins which may be broken up and presented 
by class I molecules include both "self" proteins, such as those normally 
produced in healthy cells, and "foreign" proteins, such as the viral proteins in 
virus-infected cells or the abnormal proteins produced in tumor cells. Thus, 
class I molecules primarily function to report to the immune system the health 
of the intracellular environment. The TCRs of CD8+ cytotoxic T lymphocytes 
(CTLs) interact with the MHC class I/peptide complex. 

Structurally, class I molecules are heterodimers comprised of two 
noncovalently bound polypeptide chains, a larger "heavy" chain (a) and a 
smaller "light" chain (P-2-microgIobulin, or P 2 m). The polymorphic, polygenic 
heavy chain (45 kDa), encoded within the MHC on chromosome six, is 
subdivided into three extracellular domains (designated cc 17 oc 2/ and cx 3 ), one 
intracellular domain, and one transmembrane domain (FIG. 36). The two 
outermost extracellular domains, a x and cc 2/ together form the groove that 
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binds antigenic peptide (FIG. 37). Thus, interaction with the TCR occurs at this 
region of the protein. The a 3 domain of the molecule contains the recognition 
site for the CD8 protein on the CTL; this interaction serves to stabilize the 
contact between the T cell and the APC. 

The invariant light chain (12 kDa), encoded outside the MHC on 
chromosome 15, consists of a single, extracellular polypeptide. In the 
recombinant truncated HLA class I molecule (soluble HLA or sHLA, 
approximately 47 kDa), the C-terminal intracellular and transmembrane 
domains of the heavy chain are not amplified by PCR and thus deleted, resulting 
in a functional MHC/peptide complex that is secreted from the cell and still 
capable of interaction with the TCR. 

Antigen processing and assembly of the MHC class I/peptide complex. The 
heavy chain of the class I heterodimer is cotranslationally inserted into the 
lumen of the ER (FIG. 38). The extracellular (intralumenal) domains of newly 
synthesized a chains are glycosylated and become immediately associated with 
the ER chaperone protein calnexin. Calnexin is a membrane-bound molecule 
that functions to temporarily keep the heavy chain in a partially folded state. 
The subsequent noncovalent interaction of free 3 2 m with the heavy chain 
causes the release of calnexin, and the heavy chain/|3 2 m complex becomes 
sequentially associated with the chaperones calreticulin and tapasin. These 
chaperones position the class I heterodimer in a manner that enables it to be 
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loaded with the processed peptides once they are transported into the ER by 
the TAP (transporter associated with antigen processing) complex, located 
within the ER membrane. 

In the cytosol, a multisubunit, multicatalytic protease complex called the 
proteosome functions to prepare the endogenously synthesized proteins for 
presentation by MHC class I molecules. Once proteins (viral or otherwise) are 
cleaved into peptides of various lengths by the proteosome, the TAP complex 
actively transports the peptides into the ER where the MHC heterodimer awaits. 
In the ER lumen, peptides are further trimmed to lengths of 8-11 amino acids. 
If possessing the requisite binding affinity to the particular class I allele, a 
peptide will be loaded into the binding cleft of the class I heavy chain to form 
the MHC heavy chain/light chain/peptide heterotrimeric complex, which is 
subsequently routed to the cell surface via the Golgi apparatus (FIG. 39). 

Cells incapable of antigen processing, for reasons such as inhibition of the 
proteosome or mutation of TAP, will likewise be deficient in the cell-surface 
expression of the class I/peptide complex since the class I heterodimer cannot 
leave the ER until it binds peptide. A particular cell line called T2, which is TAP- 
deficient and therefore has no cell-surface class I expression, is widely utilized 
in antigen processing studies. Additionally, it has been shown that abundant 
exogenous peptide can be pinocytosed, enter an intracellular trafficking 
pathway, and be delivered to the ER compartment directly in a TAP- 
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independent way. High affinity peptides delivered to the ER in this manner can 
then bind to de novo-synthesized MHC class I molecules and be subsequently 
displayed at the cell surface. Thus, T2 cells, the system of mammalian 
expression utilized in the experiments herein, are capable of cell surface MHC 
class I expression when pulsed with exogenous peptide of sufficient affinity for 
the particular class I allele. 

The interaction of the MHC/peptide complex with the TCR is the central 
event in the initiation of most antigen-specific immune responses. The TCR 
recognizes the antigen complex by forming intermolecular contacts with both 
the class I molecule and the antigenic peptide. The outcome of this interaction 
(i.e. whether or not an immune response is generated) is dependent upon the 
density and duration of the TCR-MHC/peptide binding. Thus, an immunogenic 
MHC/peptide complex will bind the TCR with greater avidity than a non- 
immunogenic MHC antigen complex. Furthermore, it is known that a 
monomeric MHC/peptide complex, even if immunogenic, dissociates rapidly 
from a TCR, indicating that multiple TCRs must interact with multiple 
MHC/peptide complexes in order to activate T cells. In light of this information, 
Altman et al. disclosed a suggested methodology to analyze antigen-specific T 
cell populations by multimerizing the MHC/peptide complex into tetramers. 
Essentially, a tetramer consists of four biotinylated MHC/peptide complexes 
non-covalently bound to an avidin molecule (FIG. 40). Compared to a 
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monomeric MHC/peptide complex, tetramers have slower rates of dissociation 
from CTLs since they are able to bind more than one TCR on that particular 
CTL This unique characteristic makes tetramers very useful as immunological 
stains. 

Until now, tetramers have incorporated MHC/peptide complexes that were 
made by expressing a recombinant class I heavy chain in bacteria and 
subsequently refolding the heavy chain in vitro in the presence of p 2 m and a 
specific peptide ligand. The purpose of this project was to make tetramers 
using MHC/peptide complexes synthesized in mammalian cells, specifically cells 
of the antigen-processing mutant cell line T2 so that every MHC/peptide 
complex secreted would contain the same antigenic peptide. It is desirable to 
reflect as much as possible the actual in vivo interaction between human cells 
expressing cell-surface MHC/peptide complexes and cytotoxic T lymphocytes. 

The three following oligonucleotide primers, designated A, B, and C, were 

purchased from commercially available sources (i.e. operon): 

Primer A, a 5' primer: 
5' GGG CCTCGAG GGACTCAGAATCTCCCCAGACGCCGAG 3' 

Primer B, a 3' primer: 

5' CCG CAAGCTT CCATCTCAGGGTGAGGGGCT 3 ' 

Primer C, a 3' primer: • 

5' CCG CGAATTCT TATTCGTGCCATTCGATTTTCTG 3' 

Primers A and B were used in PCR #1 and the template cDNA was a 
recombinant sHLA-B*0702 truncated gene with a 6-histidine tail in pcDNA3.1(- 
), a mammalian expression vector. PCR #1 was designed to incorporate 5'Xho 
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I (CTC GAG) and 3' Hind III (AAG CTT) cut sites and to amplify the truncated 
B*0702 heavy chain lacking a 3' stop codon. All PCR reactions for this project 
were performed with a proofreading taq polymerase (PFU Polymerase, 
Promega) and were subjected to 25 cycles of 95°C for 1 minute (strand 
separation), 59°C for 1 minute (primer annealing), and 72°C for 2 minutes 
(DNA synthesis), followed by a final extension time of 7 minutes at 72°C. The 
product of PCR #1 was purified using a QIAquick PCR purification kit (QIAGEN) 
and was subsequently digested for 2 hr at 37°C with Xho I and Hind III. 
Concurrently, a bacterial expression vector (C-Terminal Biotin AviTag Vector, 
Avidity; catalog number pAC-6) (FIG. 41) encoding the biotinylation substrate 
peptide sequence GLNDIFEAQKIEWHE was digested with the same restriction 
enzymes under the same conditions. The digest products were gel purified 
using a Freeze N Squeeze kit (BioRad) and ligated together for 2.5 hours at 
room temperature. Competent cells of E. coli strain JMlo9 were transformed 
with the ligated DNA, plated on LB/ampicillin agar, and incubated for 16 hours 
at 37°C. Using colony PCR, colonies were screened for insert of the gene into 
the vector. 

The B*0702t-no stop/AviTag vector DNA was isolated and prepared using 
a DNA Miniprep kit (Promega) and served as the template cDNA for PCR #2. 
Primers A and C were used in PCR #2, which was designed to maintain a 5' Xho 
I cut site, incorporate a 3' EcoR I cut site (GAA TTC) distal to the bsp, and 
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amplify the B*0702t gene with the bsp on its 3' end. The PCR product was 
purified with the QIAquick purification kit and, along with the mammalian 
expression vector pcDNA3.1(-), was digested with Xho I and EcoR I restriction 
enzymes. Digest products were gel purified (Freeze N Squeeze), ligated 
together, and transformed into competent JMlo9 E. colt cells, which were then 
incubated overnight at 37°C on LB/ampicillin agar. Colony PCR was performed 
to check selected colonies for insert of the B*0702t-bsp gene into pcDNA(-). 
DNA from clones with insert was prepared (Miniprep kit) and sequenced using 
cycle sequencing. Sequences were analyzed and a good clone was identified. 
The plasmid DNA of the good clone was isolated and prepared for transfection 
using a DNA Midiprep kit (QIAGEN). 

Cells of the human cell line T2 were cultured in RPMI 1640 media + 20% 
fetal calf serum, 1% penicillin/streptomycin, and 0.25% phenol red. A total of 
1.7 X 10 7 T2 cells were transfected with 30 [lg of B*0702t-bsp/pcDNA3.1(-) 
DNA by electroporation using the Gene Pulser (BioRad) at 0.25 V and 960 [iFD. 
Transfected cells were selected in a medium containing 40% RPMI 1640, 40% 
conditioned media, 20% fetal calf serum, 2% penicillin/streptomycin, 0.2% 
phenol red, and 1.5 mg/mL G418 neomycin (Cellgro). Surviving cells were 
pulsed with the synthetic HIV GAG peptide NH 2 -S-P-R-T-L-N-A-W-V-COOH at 
20 ug/mL and then incubated for 24 hours at 37°C. Transfectants were then 
screened by EUSA using W6/32 (8 \ig/mL), which is directed against the entire 
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HLA heavy chain/light chain (p 2 m)/peptide complex, as the primary (capture) 
antibody and anti-(3 2 m conjugated to HRP (diluted 1:1000) as the secondary 
(conjugate) antibody (Dako). 

High-producing cells were then expanded and cultured (continuously being 
repulsed with peptide), and their supernatant was collected and centrifuged to 
remove cell debris. The specific MHC/peptide complexes were purified using an 
affinity chromatography system (AKTA™ prime). In brief, the supernatant was 
passed over a 38 mL bed volume XK 26 column of cyanogen bromide-activated 
Sepharose 4B (Amersham Pharmacia Biotech) coupled to W6/32. The protein 
was then eluted intact in basic buffer (0.1 M glycine-NaOH, pH 11.0). To 
neutralize the eluate, 1.0 M Tris was added in a 1 to 4 ratio of neutralization 
buffer: elution buffer, resulting in a final solution of 0.25 M tris/0.075 M glycine, 
pH ~8.0. Immediately, the eluted protein was concentrated and buffer- 
exchanged to PBS + 0.05% Na Azide using MACROSEP centrifugal 
concentrators (Pall Filtron, MACROSEP 10K). Purified protein was quantified 
using ELISA. 

The purified sHLA-bsp/peptide complex was enzymatically biotinylated on 
a single lysine residue within the bsp by incubation with BirA (Avidity; other 
names for this enzyme include biotin protein ligase, biotin ligase, biotin operon 
repressor protein, biotin holoenzyme synthetase, and biotin-[acetyl-CoA 
carboxylase] synthetase) (FIG. 42). 
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Incubation time was varied under constant conditions to determine the 
time required for maximum biotinylation efficiency. Four identical reactions 
were incubated for either 1, 4, 8, or 16 hours at 37°C. Biotinylation mixture 
components were as follows: sHLA-bsp, 6 |JM; MgOAc (Biomix B, Avidity), 1.9 
mM; adenosine triphosphate (Biomix B, Avidity), 1.9 mM; BirA, 0.8 JJM; Tris- 
HCI, pH 8.0, 9.4 mM; biotin (Biomix B, Avidity), 10 |JM; Pefabloc SC Plus 
(Roche), a protease inhibitor, 0.42 mM. Biotinylation was confirmed using a 
modified ELISA, with W6/32 (8 ug/ mL ) as tne capture antibody and ABC 
Vectastain (Vector Laboratories), a kit containing avidin and biotinylated HRP, 
as the conjugate (FIG. 43). 

Biotinylated sHLA was separated from free biotin by applying the 
biotinylation reaction product to a Bio-Spin chromatography column (BioRad, 
Bio-Spin 30 Tris columns). Tetrameric sHLA/peptide complexes were produced 
by the stepwise addition (1/10* volume, waiting 10 minutes between each 
addition) of the conjugate UltraAvidin-R-phycoerythrin (UltraAvidin-R-PE; Leinco 
Technologies) to the biotinylated class I complex. The final mixture contained 
a 1:4 molar ratio of UltraAvidin-R-PE: biotinylated class I, which is the same as 
a 1:1 molar ratio UltraAvidin-R-PE biotin binding sites: biotinylated class I. The 
volumes of avidin and biotinylated class I were determined by calculating the 
molar concentrations of each (in moles/uL) and then making certain that the 
number of moles of biotin binding sites in the avidin equaled the number of 
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moles of biotinylated class I. (Because the biotinylation assay was qualitative 
rather than quantitative, the assumption that biotinylation efficiency was high 
(~90%) was made prior to making these calculations.) Tetramers were purified 
by gel filtration on a Superdex S-200 column (Pharmacia; Molecular Biology 
Resource Facility, University of Oklahoma Health Sciences Center). 

The construct successfully created using the AviTag bsp vector contained 
a 7-residue sequence (W-K-L-P-A-G-G) between the truncated B7 heavy chain 
at its C-terminus and the bsp. Despite the fact that the peptide binding groove 
is at the N-terminus of the heavy chain, it was undetermined at the time of 
transfection whether this 7-residue linker would in any way affect MHC protein 
folding or peptide binding capability. Because class I molecules must be 
properly folded and loaded with peptide before being directed to the cell 
surface, only the specific MHC/peptide complexes should be in the supernatant. 
Additionally, the W6/32 monoclonal capture antibody of the ELISA positively 
identified the presence of MHC-bsp/peptide complexes secreted into the 
supernatant by producing transfectants (FIG. 44). Thus, the 7 residue linker 
did not alter protein folding or peptide loading. 

With regard to the purification of the class I-bsp molecule via affinity 
column chromatography, the bsp addition to the molecule perhaps affected 
column loading, as only ~50% recovery of the purified protein was achieved 
when pre-column and post-column protein amounts were compared. A large 
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percentage of this "lost" protein was detected both in the wash, indicating non- 
specific binding to the column during loading, and in the flow-thru, possibly 
indicating that the additional 22 amino acids (15 amino acids in the bsp and 7 
amino acids linking the bsp to the class I, discussed above) on the C-terminus 
of the molecule interfered with loading. If the latter occurred, then ELISA 
results quantifying sHLA-bsp production by T2's may be underestimated, since 
the same W6/32 antibody used as the capture in the ELISA is also coupled the 
column. The elution curve (FIG. 45), produced as a result of the 
spectrophotometric (OD280) measurements of the column system during 
elution, allowed determination of which eluted fractions contained protein. 
These fractions were then pooled and the purified protein was concentrated and 
biotinylated. 

In a 1993 paper, Peter Shatz demonstrated that an array of BirA substrate 
peptide sequences, all having in common a lysine residue at position 9 or 10, 
existed and were capable of being biotinylated. However, some bsp sequences 
biotinylate more robustly than other sequences. It is important, therefore, to 
determine the reaction time required to optimally biotinylate a given protein 
with its specific bsp, so as to not waste the protein in future biotinylation 
reactions. FIG. 46 demonstrates that increasing the incubation time of the 
biotinylation reaction, with all other variables unchanged, allows for greater 
biotinylation. 
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Importantly, purifying the biotinylated class I molecules from any free 
biotin remaining in the reaction mixture was performed prior to making 
tetramers, as free biotin may assume one or more of the four biotin binding 
sites on the avidin molecule in place of a biotinylated class I molecule. FIG. 47 
indicates that some biotinylated protein was lost in this step. However, this 
purification ideally eliminates any "contamination" from monomers, dimers, or 
trimers when making tetramers, optimizing tetramer yield. 

Upon purification of the tetramers via gel filtration (Superdex-200), 
tetramers (MW 498.6 kDa) were separated on the basis of size from any other 
possibly-present molecule contained in the reaction mixture [free avidin-PE 
conjugate (300 kDa— avidin 60 kDa and Phycoerythrin 240 kDa); free 
biotinylated class I (49.65 kDa); unbiotinylated class I-bsp, since biotinylation 
efficiency was not likely 100% (49.4 kDa); free biotin, despite the purification 
step (243 Daltons); monomers (349.65 kDa); dimers (399.3 kDa); or trimers 
(448.94 kDa)]. 

The present use of tetramers as reagents to stain T cells specific to an 
epitope is widely known and utilized. Additionally, the use of tetramers in 
vaccine development and immune modulation is a reality in present biomedical 
research. Several potential applications of tetramers in modulating the human 
immune system include tolerance induction in autoimmune diseases and 
adoptive transfer of antigen-specific T cells in the clinic. If the specific epitope 
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causing the immune response in an autoimmune disease is known, a tetrameric 
complex specific to the patient's HLA type and containing the immunogenic 
epitope could be made and, in theory, given therapeutically to the patient in a 
dose large enough to induce tolerance. Liposomes, which are artificial antigen 
presenting cells that can be made and have incorporated in their membranes 
specific MHC/peptide complexes, are a cousin to the tetramer and would likely 
be used, rather than tetramers, for this type of immune modulation. Adoptive 
transfer of T cells specific to an antigen of a disease-causing agent (tumor or 
viral) could be achieved by using tetramers (specific to the patient and the 
antigen) to sort live T cells, culturing these T cells in vitro to increase their 
numbers, and then transferring the cells back into the patient in hopes of 
enhancing the patient's immune response. 

Thus, as the potential therapeutic use of tetramers in the clinic becomes 
a reality, it is evident that a tetramer molecule containing MHC molecules 
expressed in mammalian cells would more so reflect the protein produced in 
vivo. Although it is not known for certain the significance of the role of MHC 
glycosylation in in vivo intra- and inter-molecular interactions, it fs important 
both in research and clinical therapy to use a system that emulates the actual 
interaction as much as possible. Likewise, it is not known if the class I trimeric 
complex formed inside mammalian cells is structurally identical to that formed 
outside the cell, and for this reason we choose to load sHLA class I molecules 
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inside the cell. In summary, we show that class I molecules can be loaded with 
peptide inside a mammalian cell, secreted from the cell, biotinylated, and that 
these biotinylated class I molecules interact with streptavidin. 

Alternatively, a cell line deficient in peptide processing but still efficient in 
peptide loading may be utilized for both epitope and functional testing, so that 
a putative epitope can be expressed or pulsed into a cell and loaded into the 
HLA molecule in the ER of such cell. The cDNA construct isolated as described 
above may be ligated into a mammalian expression vector which also contains 
a DNA fragment encoding a peptide of interest attached to a fragment encoding 
a signal peptide so that the peptide of interest will be retained in the ER of the 
cell for loading, and such construct transfected into the mammalian cell line 
deficient in peptide processing but which retains the ability to load peptide in 
the HLA molecules, such as the T2 cell line. In this manner, the peptide of 
interest is produced together with the HLA molecule. The soluble HLA molecule 
(with or without a His or biotinylation signal tail) can then be purified and 
utilized as a reagent that has been produced in mammalian cells (fully 
glycosylated, etc.) and is loaded with the single co-transfected peptide. 
Optionally, random oligomers could be made and cloned into such a mammalian 
expression vector, and the soluble HLA molecules could again be purified and 
used to characterize T cells or other immune effector cells. In a further 
alternative, rather than expressing the peptides with the HLA molecule, the cells 
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expressing the HLA molecule could be pulsed with a single synthetic peptide or 
multiple synthetic peptides and analyzed as described above to identify bound 
peptides. Any of the HLA molecule-peptide complexes could be multimerized 
to form dimers, tetramers, etc. and tested for their ability to serve as ligands 
for CTLs and induce immune responses in humans. 

In summary, the method of the present invention involves production of 
MHC class I and class II molecules beginning from gDNA and/or cDNA. The 
gDNA clones encoding a given MHC molecule can be truncated to be secreted 
rather than bound at the cell surface. This truncated version of the MHC 
molecule can be produced in mammalian or insect/bacterial cells such that 
milligram or greater quantities of an individual class I or class II molecule can 
be obtained. The secreted MHC class I molecules can be naturally loaded with 
thousands of endogenous peptides in mammalian cells, while the secreted MHC 
class II molecules can be naturally loaded with thousands of endocytic peptides 
in mammalian cells. Alternatively, the secreted MHC proteins can be produced 
in cells that do not load the MHC molecule with peptide ligand. Production of 
MHC proteins in cells which do not load the MHC molecule with peptide 
facilitates the loading of the MHC molecule via co-transfection with constructs 
encoding a given peptide(s). Alternatively, the MHC peptlde-loading deficient 
MHC transfectant can be pulsed with peptides or DNA encoding peptides. The 
resulting individual secreted MHC molecules are useful for studies of peptide 
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loading (i.e. in vaccine development), for characterizing human immune 
responses to a given MHC molecule loaded with a particular peptide(s), and to 
the development of diagnostics where one needs sufficient MHC protein in order 
to directly assess reactivity to different MHC proteins. 

Another important component of the secreted MHC molecules described 
here is that naturally loaded peptides can be eluted from the MHC molecules 
and characterized. Substantial quantities of peptide can be obtained from 
individual MHC molecules, and the peptides can be selectively characterized. 
Unique information results from having a sufficient supply of eluted peptide, 
and this information is essential to databases and predictive algorithms which 
are essential to the vaccine architect. 

Thus, in accordance with the present invention, there has been provided 
a methodology for producing and manipulating Class I and Class II MHC 
molecules from gDNA as well as the sHLA molecules and their uses that fully 
satisfies the objectives and advantages set forth herein above. Although the 
invention has been described in conjunction with the specific drawings, 
experimentation, results and language set forth herein above, it is evident that 
many alternatives, modifications, and variations will be apparent to those skilled 
in the art. Accordingly, it is intended to embrace all such alternatives, 
modifications and variations that fall within the spirit and broad scope of the 
invention. 
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AH of the numerical and quantitative measurements set forth in this 
application (including in the examples and in the claims) are approximations. 

The invention illustratively disclosed or claimed herein suitably may be 
practiced in the absence of any element which is not specifically disclosed or 
claimed herein. Thus, the invention may comprise, consist of, or consist 
essentially of the elements disclosed or claimed herein. 

The following claims are entitled to the broadest possible scope consistent 
with this application. The claims shall not necessarily be limited to the 
preferred embodiments or to the embodiments shown in the examples. 
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HLA-II DPa (32-40) 
HLA-I a (25-32) 
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GQRKGAGSVF 
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SHAQTWL 
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LPHQPLATY 

AKYSTPATL 

AKAGITTTL 

TQAPGNPVL 
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NQFQALLQY 



60S ribosomal protein L8 (7-16) 
40S ribosomal protein S3 (100-107) 
40S ribosomal protein S4 (73-81) 
40S ribosomal protein S27 (48-55) 
eIF3-p66 (61-69) 
zinc finger transcription factor (296-304) 
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spleen mitotic checkpoint BUB3 (53-60) 
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B*1501 
B*1501 
B*1510 
B*1510 
B*1508 
B*1503 

B+1503 

B*1510 
B*1510 
B*1512 



B*1501, 1508 
B*1501,B*1502, 1508, 

B*1512 
B*1501,B*1502, 1508, 
B*4601,B*1512 
B*1510 

B*1510 
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mammary tumor-associated protein INT6 
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human EST 
human EST 
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B*1510 
B*1510 
B*1512 
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Y 


M 


I 


D 


P 


S 


G 


V 


S 


Y 




B*1 


1501/B*1508/B*1512 


19/20 


560.0 (+2) 


X 


V 


E 


X 


T 


T 


D 


Y 


Y 






B*l 


1501/B*1512 


20/21 


448.2 (+2) 


A 


A 


G 


X 


G 


P 


T 


F 


Y 






B*] 


1501/B*1512 


20/21 


614.0 (+2) 


I 


A 


V 


G 


Y 


V 


D 


D 


T 


Q 


F 


B*] 


I501/B*1512 


21/22 


507.2 (+2) 


V 


A 


F 


V 


X 


F 


V 


G 


Y 






B*] 


1501/B*1512 


21/22 


557.2 (+2) 


Y 


N 


R 


W 


S 


X 


E 


F 








B*] 


1501/B*1512 


22/23 


510.8 (+2) 


A 


L 


M 


P 






X 


N 


Y 






B*l 


I501/B*1512 



TABLE 14 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



GO 
> 

o 

<L> 



CO 

O 



to 

-a 

, t2 <t> 

> ^ 
o 



o 



o 
a 



13 



o 
in 



o 

CM 



IT) 

PQ 



o\o 



O 



CM 



CM 



O 



in 
iii 

CD 

< 



CO 



00 
00 



oo 
o 

PQ 



m 
o 

PQ 



o 
PQ 



171 

SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



motif P2/P9 


+ length variation only 


+ P2 variation only 


DLASMLNRY 


(64-72) 


MQLLCVF 


(1-7) 


DIEGHASHY 


(28- 


36) 


MLNRYKLIY 


(68-76) 


HIJDIEGHASHY 


(26-36) 


SAPLEKQLF 


(123 


-131) 


PLEKQLFYY 


(125- 


133) 


MLSAPLEKQLF 


(121-131) 


APLEKQLFY 


(124 


-132) 


YQLRCHLSY 


(149- 


157) 


PLEKQLF 


(125-131) 


LPNTRPHSY 


(138 


-146) 


ALSINGDKF 


(159- 


167) 


PLEKQLFY 


(125-132) 


NTRPHSYVF 


(14 0 


-148) 


DLPDLRGPF 


(203- 


211) 


TMLPNTRPHSY 


(136-146) 


SINGDKFQY 


(161 


-169) 


FVPNLKDMF 


(242- 


250) 


MLPNTRPHSY 


(137-146) 


YTGAMTSKF 


(169 


-177) 


AVTMTAASY 


(253- 


261) 


QLRCHLSY 


(150-157) 


TSKFLMGTY 


(174 


-182) 


TMFEVSVAF 


(290- 


298) 


YVALS INGDKF 


(157-167) 


LTSAQSGDY 


(216 


-224) 


DLRWLAKSF 


(314- 


322) 


FQYTGAMTSKF 


(167-177) 


YSLVIVTTF 


(224 


-232) 


HLTTEKQEY 


(366- 


374) 


AMTSKFLMGTY 


(172-182) 


V IVTTFVHY 


(227 


-235) 


ALRLATVGY 


(375- 


383) 


HVLSLVF 


(192-198) 


TTFVHYANF 


(238 


-246) 


ALGTESGLF 


(467- 


475) 


SLTSAQSGDY 


(215-224) 


MTAASYARY 


(256 


-264) 


AVSNAVDGF 


(505- 


513) 


SLVIVTTF 


(225-232) 


DTETLTTMF 


(284 


-292) 


ALYEASTTY 


(564- 


572) 


LVIVTTF 


(226-232) 


ATVKGMQSY 


(338 


-346) 


RQIPKIQNF 


(597- 


605) 


IVTTFVHY 


(228-235) 


ATSVLLSAY 


(396 


-404) 


ILSSNYFDF 


(643- 


651) 


I VTTFVHYANF 

**- V ^ JL JL Vlll JT^L^ ^ 


(228-238 ) 


SAYNRHPIjF 


(402 


-410 ) 


TVMEIAGLY 


(666- 


674) 


FVHYANFHNF 


(232-241) 


HTVMRETLF 


(414 


-422) 


HWLAIILY 


(679- 


687) 


FVHYANFHNFY 


(232-242 ) 


ESGLFSPCY 


(471 


-479) 


WLAIILYF 


(680- 


688) 


TMTAASY 


(255-261) 


SPCYLSLRF 


(476 


-484) 


FLVHKIVMF 


(696- 


704) 


TMTAASYARY 


(255-264) 


I I PLINVTF 

•JL <JL JL7 JLJ V JL JL 


(544 


-552 ) 


LiVHKIVMFF 


(697- 


705) 


ELDTETLTTMF 


(282-292) 


TTYLSSSLF 


(570 


-578 ) 








TMFEVSVAFF 


(290-399) 


NS ILSSNYF 


(641 


-649) 








TVLKDIIGICY 


(326-326) 


AIILYFIAF 


(683 


-691) 








VLKDIIGICY 


( 327-326 ) 


F TAFAT.fi T F 


(688 


-696 ) 








TVKGMOSY 


(339-346) 














RLATVGY 


(377-383 ) 














TVGYPKAGVY 


(380-389) 














LLSAYNRHPLF 


(400-410) 














PLHTVMRETLF 


(412-422 ) 














VMRETLF 


(416-422) 














GLALGTE S GT. P 


(465-475) 














GLFSPCY 


(473-479) 














T.MTTPT.TMVTP 


(542-552) 














PLINVTF 


(546-552) 














EVRGSAIjY 


(559-566) 














YLSSSLF 


(572-578) 














TOKSCIF 


(608-614) 














TOKS C I F CGF 


(608-617) 

\ W w W W -JL> / / 














GLETTTY 


(627-633) 














VQNSILSSNY 


(639-648) 














VQNSILSSNYF 


(639-649) 














ILSSNYF 


(643-649) 














VMEIAGLY 


(667-674) 














WLAIILY 


(680-687) 














WLAIILYF 


(680-688) 














VLAIILY 


(681-687) 














VIiAIILYF 


(681-688) 














VLAIILYFIAF 


(681-691) 














IIiYFIAF 


(685-691) 














FLVHKIVMFF 


(696-705) 









TABLE 16 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



i 



I 



<9 



2 



2 

o 



§ 

8 



o 

3 



1 



CO 

§ 

O 



CJ 



PQ 



o 



U 



CD 
GO 



CD 



a 



6 

O 

& 

1 

H 
O 

? 

o 
o 
o 

< 

g 

o 
o 
a 



m 



P-. 



H 
CJ 
O 
O 
O 
O 

s 

s 

o 
o 

s 

I 

& 

i 

O 

o 
o 
u 



o 



I 



o 

s 

o 
o 



S3 

a 

CO 
Ph 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



m 

O 

I 

CO 



CD 

S 
•n 
p-. 



© 
o 
o 

I 

1 



o 

Ph 

H 



O 

3 

1 



1 



Ah 



3 

a 

i 

H 

a 

i 

o 
o 



3" 

8 

a> 

<L> 
00 

>^ 



00 



UJ 
CO 

< 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



Sample 


OD 

260nm 


OD 

280nm 


260nm/28 
Onm 


Dilution 
factor 


Concentrat 
ion ug/ml 


3A394 


0.0346 


0.0202 


1.7111 


20 


34.5821 



TABLE 19 



Sample j 


OD 

260nm 


OD 

280nm 


260nm/28 
Onm 


Dilution 
factor 


Concentrat 
ion ug/ml 


3A394TPC 
1 


0.2821 


0.1505 


1.8739 


20 


282.0960 



TABLE 20 



Sample 


OD 

260nm 


OD 

280nm 


260nm/28 
Onm 


Dilution 
factor 


Concentrat 
ion ug/ml 


3A394TPC 
1 


0.6919 


0.3625 


1.9087 


50 


1729.8492 



TABLE 21 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



Sample 


Decay 
time 

millisecon 
ds 


# live cells/ml 


# dead 
cells/ml 


Viability % 


3A394TPC 
1 


19.8 


1.12x10 s 


1.65x10 s 


87.16 



TABLE 22 



Sample 


Optical 

Density 

492nm 


Dilution 


Concentration of soluble 
HLA ng/ml 


3A394TP 
CI well 1 


1.278 

1.388 (over 
range) 


1.0 


247.270 


3A394TP 
CI 

well 2 


1.227 
1.274 


1.0 


229.855 


3A394TP 
CI 

well 3 


1.021 
1.042 


i.o 


154.403 


3A394TP 
CI 

well 4 


1.108 
1.070 


1.0 


169.001 



TABLE 23 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/USO 1/49744 



Allele 


Allele Allele 
AlleleConcentration by 
ELISA ug/ml 


Concentration by 
ELISA ug/ml 
Concentration by 
ELISA ug/ml 
Concentration hv 
ELISA ug/mlTotal 
amount made mg 


Total amount 
made ma 

Tnt*?il r% mm i nt* 

made ma 
Total amount 
made mqHLA- 

A U jUI 


545.4 


3.47 


HI A A*1 1 m 
rl LA- A 1 1UZ 


QQQ IT 


Z-3 / 


n LA-A ZyUZ 


/D.O 


/.JO 


uii a A>fc*3nm 
n LA-A J5UUZ 




J.JO 


LI 1 A A JkQOn 1 

HLA-A^oZUl 


looz.U 


y.oi 


HLA-A^JoUl 


/inn 


n o 
U.o 


HLA-b^UoUl 


OD.U 


zl.U 


nLA-B*1302 


c c" n 

55-0 


n n 


HI A_R=M zini 
rlLA-D 1S-U1 




DU.U 


HLA-B*1801 


587.6 


0.4 


HLA-B*3701 


1831.0 


119.0 


HLA-B*3801 


128.0 


66.0 


HLA-B*3905 


1400.0 


120.0 


HLA-B*40012 


59.0 


10.0 


HLA-B*4002 


400.0 


180.0 


HLA-B*4101 


288.4 


8.8 


HLA-B*4402 


59.0 


10.0 



TABLE 24 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



fraction 


ion for MS/MS 


derived peptide sequence 


6 


398.2 (+3) 


- -WDRHTXF 


6 


448.2 (+2) 


----- Y T 


7 


382.7 (+2) 


V Q F E A A T 


7 


418.7 (+2) 


AQFASGAGZ 


7 


455.2 (+2) 


ALGA - - R G Y 


7 


489.1 (+2) 


- - V - - G H X Y 


7 


506.8 (+2) 


XS---CEY 


8 


402.2 (+2) 


- G - - C D Y 


8 


419.2 (+2) 


GSHFGVAY 


8 


433.8 (+2) 


APPPPPPPP 


8 


455.2 (+2) 


- - - ZARGY 


8 


462.2 (+2) 


DPHAPPZY 


8 


507.2 (+2) 


AVPSXHXXY 


8 


512.3 (+2) 


XAZVZMTAY 


8 


512.8 (+2) 


ALNGRVTMY 


8 


516.9 (+2) 


NQZHGSAEY 


8 


522.9 (+2) 


FGXACXATSY 


8 


642.7 (+2) 


PMNDWXMTZTY 


9 


331.4 (+3) 


APMARGZY 


9 


418.7 (+2) 


F V S N H A Y 


9 


426.2 (+3) 




9 


433.3 (+2) 


NPPAZZPN 


9 


437.0 (+2) 


T G - - - - AY 


9 


441.2 (+3) 


- Q-DPPPDMZY 


9 


446.6 (+2) 


GQZZAVDF 


9 


453.6 (+2) 


X Q - - AGGZY 


9 


465.2 f+2) 


SQFGGGSQY 


9 


476.2 (+2) 


SQFDHVTY 


9 


481.0 (+2) 


GQHASVXSY 


9 


514.2 (+2) 


- - AAHVPPGY 


9 


550.2 (+2) 


FMDVGAPTVY 


9 


578.0 (+2) 


TPXGEPYZSY 


10 


398.3 (+2) 


X A N - - V T 


10 


448.2 (+2) 


AQAAPFAGY 


10 


448.4 (+2) 


VVVFGVZ F 


10 


450.4 (+2) 


A Q M - - SEY 


10 


456.8 (+2) 


C P L S C F T 


10 


464.7 (+2) 


- - - - FGHY 


10 


473.7 (+2) 


A L W - - P Z F 


10 


486.4 (+2) 


V P H Z N A Y 


10 


498.7 (+2) 


-----GHGGY 


10 


509.0 (+2) 


FLZAMZSTY 


10 


527.7 (+2) 


GQYVVZPTY 


10 


532.0 (+2) 


TVXDSZTHY 


10 


540.2 (+2) 


PMFDPPZTF 


11 


469.2 (+2) 


AQAESLRY 


11 


480.6 (+2) 


XAVGHSGGTY 


11 


511.2 (+2) 


-----PTY 


11 


516.7 (+2) 


ESXPNNVPY 


12 


383.0 (+3) 


LAHTECPRGY 


12 


435.0 (+2) 


-----PSY 


12 


473.2 (+2) 


VQGPVGVQY 



TABLE A 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCTYUS01/49744 



12 


475.0 (+2) 


12 


505.0 (+2) 


12 


513.7 (+2) 


12 


519.0 (+2) 


12 


531.2 (+2) 


12 


541.0 (+2) 


12 


542.2 (+2) 


12 


553.7 (+2) 


12 


577.2 (+2) 


13 


363.0 (+3) 


13 


447.0 (+2) 


13 


465.0 (+2) 


13 


503.6 (+2) 


13 


553.2 (+2) 


13 


590.8 (+2) 


14 


460.7 (+2) 


14 


471.5 (+2) 


14 


495.2 (+2) 


14 


525.3 (+2) 


14 


541.3 (+2) 


14 


550.4 (+2) 


14 


577.2 (+2) 


15 


459.8 (+2) 


15 


470.9 (+2) 


15 


514.2 (+2) 


15 


536.6 (+2) 


15 


544.2 (+2) 


15 


564.2 (+2) 


16 


467.1 (+2) 


16 


500.4 (+2) 


16 


501.0 (+2) 


16 


503.6 (+2) 


16 


517.4 (+2) 


16 


521.3 (+2) 


16 


522.0 (+2) 


16 


525.6 (+2) 


16 


536.0 (+2) 


16 


544.4 (+2) 


16 


557.6 (+2) 


16 


561.5 (+2) 


16 


596.7 (+2) 


16 


596.9 (+2) 


17 


343.2 (+3) 


17 


351.3 (+3) 


17 


408.7 (+2) 


17 


441.2 (+2) 


17 


451.4 (+2) 


17 


455.0 (+2) 


17 


462.4 (+2) 


17 


483.8 (+2) 


17 


490.2 (+2) 


17 


497.2 (+2) 


17 


507.0 (+2) 


17 


511.2 (+2) 


17 


519.2 (+2) 



R 


G 


X 


G 


V 


A 


G 


T 


A 


F 


T 


G 


A 


P 


V 


S 


E 


E 


G 


Y 


V 


Q 


X 


Y 


Y 


G 


S 


V 


V 




E 


P 


A 


M 


V 


X 


Z 


C 


F 




G 


Q 


P 


G 


A 


P 


X 


G 


G 


Z 


G 


P 


P 


H 


N 


G 


X 


R 


A 


Y 


A 


A 


H 


W 


H 


V 


E 


A 


Y 




T 


P 


P 


T 


R 


R 


E 


S 


Y 




F 


P 


T 


D 


R 


R 


S 


Q 


F 




Y 


T 


G 


V 


S 


Y 


X 


H 


F 




A 


Q 


A 


s 


A 


P 


D 


A 


Y 




V 


Q 


Y 


Y 


X 


P 


F 








G 


Q 


R 


K 


G 


A 


G 


S 


V 


F 


X 


Q 


Z 


X 


- 


- 


D 


V 


Y 




A 


T 


G 


T 


A 


Z 


N 


X 


N 


Z 


V 


V 


A 


P 


I 


T 


T 


G 


Y 




V 


V 


A 


C 


V 


- 


- 


- 


Y 




X 


Q 


Y 


T 


V 


G 


Y 


F 






P 


L 


A 


- 


N 


- 


H 


T 


Y 




P 


L 


F 


G 


Q 


T 


A 


G 


Q 


Y 


A 


- 


- 


- 


- 


Q 


X 


E 


Y 




Z 


G 


Y 


G 


N 


P 


X 




G 


A 


V 


Q 


G 


P 


V 


G 


T 


D 


F 




V 


A 


G 


G 


w 


- 


_ 


- 


F 




F 


Q 


A 


R 


X 


T 


E 


Y 






X 


A 


G 


F 


F 


X 


X 


E 


Y 




X 


Q 


- 


_ 


- 


- 


z 


Y 






S 


G 


A 


X 


D 


R 


A 


Y 


Z 


F 


F 


Q 


- 


_ 


- 


- 


T 


X 






T 


P 


- 


_ 


- 


A 


Z 


A 


F 




V 


V 


A 


T 


Z 


N 


Z 


Z 


X 




Y 


M 


V 


T 


- 


- 


- 


F 






A 




G 


S 


z 


A 


X 


M 


P 


F 


A 


P 


A 


V 


- 


- 


- 


V 


G 


Y 


V 


G 


Y 


V 


D 


D 


T 


Q 


F 
















T 


G 


F 




P 


V 


P 


N 


V 


R 


X 


N 


Y 
















T 


X 


S 


X 


T 


It 


E 


G 


W 


M 


S 


Z 


Y 




Y 


M 


V 


C 


N 


A 


E 


E 


Y 














X 


R 


D 


X 


Y 


S 


L 


X 












F 




A 


Q 


H 


P 


S 


A 


X 


R 


F 




A 


A 


F 


c 


G 


- 


- 


- 


X 


V 


Y 


Xj 


H 


_ 


_ 


E 


T 


















Z 


A 


Y 






I 


L 


G 


P 


P 


G 


S 


V 


Y 




G 


L 


G 


z 


T 


S 


A 


E 


F 




X 


D 


G 


D 


V 


N 


M 


Y 






V 


M 


G 


X 


T 


N 


A 


N 


F 




N 


A 


X 


G 


R 


E 


S 


S 


F 




A 


M 


N 


P 


T 


N 


T 


V 


F 












X 


V 


E 


F 






X 


Q 


A 


P 


A 


X 


F 


V 


Y 




T 


A 


R 


V 


X 


S 


V 


E 


Y 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/USO 1/49744 



17 


526.8 (+2) 


A L F - - - F 


T 


Y 




17 


542.8 (+2) 


X Q X N A Y X 


S 


Y 




17 


563.2 (+2) 


G L A R C S Z 


V 


E 


Y 


18 


503.8 (+2) 


S Q X A A G V 


D 


V 


F 


18 


511.7 (+2) 


P Q G 2 M A - 


- 


Y 




18 


519.6 (+2) 


- V F V S H T 


T 


F 




18 


538.8 (+2) 


H X T G N E A 


T 


S 


F 


18 


565.7 (+2) 


A E F W A C Z 


X 


Y 




18 


566.2 (+2) 


Y M I D P S G 


V 


S 


Y 


18 


581.2 (+2) 


X Q G H H E M 


F 


Y 




20 


448.2 (+2) 


A A G X G P T 


F 


Y 




20 


560.0 (+2) 


X V E X T T D 


Y 


Y 




20 


614.0 (+2) 


I A V G Y V D 


D 


T 


Q F 


21 


507.2 (+2) 


V A F V X F V 


G 


Y 




22 


510.8 (+2) 


A L M P - - X 


N 


Y 




22 


557.2 (+2) 


Y N R W S X E 


F 






24 


546.3 (+2) 


- - Z D R N V 


T 


F 




25 


546.3 (+2) 


V V T M - - - 


Z 


Y 





* Dashes represent positions at which amino acids could not be unambiguously assigned through 
NanoES-MS/MS fragmentation patterns and/or Edman data obtained. Underlined residues designate 
tentative assignments. 

TABLE A cont'd. 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCTYUSO 1/49744 



fraction 


ion for MS/MS 


derived DeDtide seauence 


6 


471.8 (+2) 


AZVECETY 


7 


418.7 (+2) 


AQFASGAGZ 


7 


504.2 (+2) 


ZGXGGGPATSY 


8 


402.2 (+2) 


- G - - C D Y 


8 


441.2 (+2) 


____„2SF 


8 


516.9 (+2) 


NQZHGSAEY 


9 


33 1.4 (+3) 


APMARGZY 


9 


349.4 (+3) 


------GFY 


9 


418.7 (+2) 


AZVNSG-Y 


9 


426.2 (+3) 


AASSZV - - PPZY 


9 


433.3 (+2) 


NPPAZZPN 


9 


437.0 (+2) 


ACGGCGZDY 


9 


441.2 (+3) 


-Z-DPPPDMZY 


9 


446.6 (+2) 


GQZZAVDF 


9 


578.0 (+2) 


TPXGEPYZSY 


10 


426.5 (+2) 


GP---PZY 


10 


443.2 (+2) 


APZYPPPP 


10 


448 3 (+2} 


GZVCTPGSF 


10 


456 8 (+2^ 


C P L S C F T 


10 




SOFGGGSOY 


10 


465 4 f+2^ 


ASGFNGSZY 


10 


503 8 f+2^ 


-Z--YTAY 


10 


JuO. / V. • ^ J 


a v, p p r m n T? v 

V3 £J IT XT XI IN U £ X 


10 




T KADHVSTY 


10 


527 7 


X ZADHVXPY 


10 


S40 7 


----PGZVY 


10 




7. Q \7" _ _ _ «7 T 1 n V" 


11 


437 0 f+2^ 


HXGNQAAY 


1 1 


5 1 1 .4 (+2) 


ZAGTTVPVSY 


1 1 


527 4 H-2^ 


GOYPTOPTY 

\^ A AT J* *T a a 


1 1 

1 X 


J O 1 .*T ^ * ^ 


"PARS ^^NTT^TV 
n vj *j &j o xv x o x X 


12 


494.8 (+2) 


SZGG - - - TGY 


12 


526 8 (+2) 


ZGPPNYXTY 


12 


547.1 (+2) 


VKVIQQESY 


13 


454.6 (+2) 


LPPPPPPPP 


13 


476 0 \ 


AKYSTPATL 

XV A U A- AT X AJ 


13 


503 6 (+Ti 


GQRKGAGSVF 


13 


513.1 (+2) 


R Z SANHEAX 


13 


526.4 (+2) 


GKVRTDITY 


13 


553.2 (+2) 


VVXPAVRSTY 


13 


561.0 (+2) 


AKYPHVEDY 


13 


571.3 (+2) 


A Z NX S A Y V X Y 


13 


601.2 (+2) 


EVVGDTZY 


14 


438.2 (+2) 


AKAGITTTL 


14 


490.8 (+2) 


V - - TZAGSAF 


14 


517.2 (+2) 


AZAAANVXXY 


14 


531.5 (+2) 


ANHSVRDTY 


14 


535.3 (+2) 


E---GXRZY 


14 


552.8 (+2) 


XZHNDZSTY 


14 


577.2 (+2) 


ANEZXG---Y 



TABLE B 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



15 


497.3 (+2) 


A A G P 


T 


A 


Z 


E 


S 


Y 


15 


514.2 (+2) 


V A G X 


V 


F 


M 


Z 


Y 




15 


527.0 (+2) 


A Z Y 2 


A 


Z 


V 


V 


F 




15 


564.2 (4-2) 


A 2 F - 


- 


- 


Z 


X 


Y 




15 


577.2 (+2) 


Z G Y G 


N 


P 


X 


N 


Z 


Y 


15 


626.0 (4-2) 






2 


A 


P 


C 


H Y 


16 


521.6 (+2) 


A H A V 


Q 


R 


V 


V 


Y 




16 


525.6 (4-2) 


T Z X T 


V 


V 


X 


N 


Y 




17 


446.8 (4-2) 


A Z Z A 


S 


G 


X 


A 


F 




17 


492.8 (+2) 


G S H S 


M 


R 


Y 


F 






17 


503.8 (4-2) 


Y G Y G 


A 


T 


V 


E 


F 




17 


967.6 (4-1) 


V Z - - 




T 


T 


F 






18 


451.4(4-2) 


Q P G P 


Q 


I 


V 


Y 






18 


455.2 (4-2) 


N G Z X 


s 


N 


N 


Y 






18 


475.2 (+2) 


A N X V 


z 


X 


E 


Y 






18 


489.1 (+2) 


G Z - - 




Z 


G 


X 


X 


Y 


18 


497.8 (4-2) 


A M N P 


T 


N 


T 


V 


F 




18 


525.2 (4-2) 


Y N - - 




Z 


X 


F 






18 


538.8 (+2) 


-M--SYZNF 




18 


565.7 (4-2) 


A E F W 


A 


C 


Z 


X 


Y 




19 


521.6 (+2) 


S Z F G 


C 


P 


T 


R 


F 




19 


524.6 (4-2) 


X G A X 


S 


N 






E 


F 


19 


571.2 (+2) 


R Z A A 


Y 


R 


X 


T 


Y 




19 


646.2 (4-2) 


T N X H 


D 


G 


D 


G 


A 


T Z Y 



* Dashes represent positions at which amino acids could not be unambiguously assigned through 
NanoES-MS/MS fragmentation patterns and/or Edman data obtained. Underlined residues designate 
tentative assignments. 

TABLE B cont'd 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT7US0 1/49744 



fraction 


| ion for MS/MS 


derived peptide sequence 


6 


398.2 (+3) 


- - WDRHTXF 


6 


448.2 (+2) 


----- Y T 


8 


419.2 (+2) 


GSHFGVAY 


8 


441.2 (+2) 


VPCGZZSY 


8 


473.2 (+2) 


TAZXHRGY 


8 


512.8 (+2) 


XAZYEHTY 


8 


516.9 (+2) 


NQZHGSAEY 


8 


546.8 (+2) 


NGXAMHWTY 


9 


418.7 (+2) 


F V S N H A Y 


9 


437.0 (+2) 


TG----AY 


9 


465.2 (+2) 


SQFGGGSQY 


9 


476.2 (+2) 


SQFDHVTY 


9 


481.0 (+2) 


-P--GZDEV 


9 


514.2 (+2) 


NGYDGPNAGY 


9 


578.0 (+2) 


TPXGEPYZSY 


10 


398.3 (+2) 


X A N - - V T 


10 


448.3 (+2) 


MPHSGYGF 


10 


450.4 (+2) 


V D X - * - Y 


10 


456.8 (+2) 


C P L S C F T 


10 


464.7 (+2) 


-----PGFY 


10 


486.2 (+2) 


-A-PHPMGY 


10 


494.2 (+2) 


AQTVGYGEY 


10 


508.7 (+2) 


------SVY 


10 


509.0 (+2) 


FLZAMZSTY 


10 


532.0 (+2) 


TVXDSZTHY 


11 


444.1 (+2) 


TP - - ARAPT 


11 


469.2 (+2) 


S E H D R M Y 


11 


480.6 (+2) 


TGNCSGTGTY 


11 


496.8 (+2) 


AQVNPSXTY 


11 


532.3 (+2) 


S PGAETRAXY 


12 


473.2 (+2) 


YLG---GAF 


12 


494.8 (+2) 


XTSFMZVY 


12 


499.0 (+2) 


-P---PSSGY 


12 


505.0 (+2) 


TP---GRMY 


12 


513.7 (+2) 


PMFDZZVY 


12 


519.0 (+2) 


Y L - - - RTF 


12 


531.2 (+2) 


AQEHGCAAZF 


12 


542.2 (+2) 


-M---GVHDY 


12 


550.2 (+2) 


YVS--RNQY 


12 


553.7 (+2) 


AQYAAGES FY 


12 


564.0 (+2) 


TPHTZHDEY 


12 


565.2 (+2) 


YM---FMY 


13 


396.1 (+3) 


DPHYVSGHZ F 


13 


401.2 (+2) 


MVGXXPAT 


13 


526.4 (+2) 


ZASPGEXTSY 


14 


460.7 (+2) 


VVAPITTGY 


14 


471.5 (+2) 


VVACV---Y 


14 


525.3 (+2) 


PLA-N-HTY 


14 


543.2 (+2) 


XAXYRRMY 


14 


550.4 (+2) 


PLAMZXYTY 



TABLE C 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/USO 1/49744 



15 


460.6 (+2) 


- 


P 


- 


M 


P 


G 


X 


A 


Y 




15 


461.0 (+2) 


H 


T 


T 


S 


Z 


N 


A 


Y 






15 


506.0 (+2) 


M 


A 


A 


M 


V 


G 


V 


A 


V 


Y 


15 


508.4 (+2) 


G 


P 


Z 


V 


M 


Z 


H 


G 


Y 




15 


514.2 (+2) 


F 


Q 


A 


R 


X 


T 


E 


Y 






15 


520.0 (+2) 


L 


P 


H 


Q 


P 


L 


A 


T 


Y 




15 


525.2 (+2) 


A 


A 


A 


X 


V 


- 


- 


- 


V 


T Y 


15 


536.6 (+2) 


X 


P 


E 


M 


G 


Z 


F 


S 


Y 




15 


544.2 (+2) 


Y 


V 


- 


- 


V 


R 


- 


V 


F 




15 


564.2 (+2) 


F 


V 


T 


X 


N 


X 


E 


E 


Y 




16 


489.0 (+2) 


A 


A 


P 


V 


G 


A 


X 


E 


S 


Y 


16 


500.4 (+2) 


G 


S 


- 


- 


- 


S 


Y 


T 


Y 




16 


522.0 (+2) 


V 


G 


Y 


V 


D 


D 


T 


Q 


F 




16 


525.7 (+2) 


Y 


V 


A 


- 


- 


- 


P 


A 


F 




16 


533.0 (+2) 


V 


G 


Y 


- 


- 


A 


H 


P 


G 


F 


16 


535.7 (+2) 


Z 


A 


T 


N 


S 


V 


T 


S 


T 


Y 


16 


537.0 (+2) 














S 


T 


Y 




16 


545.8 (+2) 


Y 


A 


T 


A 


G 


E 


M 


M 


A 


F 


16 


547.0 (+2) 


S 


P 


T 


Y 


T 


H 


A 


V 


A 


F 


16 


557.0 (+2) 


M 


P 


A 


- 


- 


M 


V 


M 


A 


F 


17 


351.3 (+3) 


A 


A 


F 


C 


G 


- 


- 


- 


X 


V 


17 


393.7 (+2) 


S 


P 


N 


E 


D 


X 


M 


Z 


V 


F 


17 


403.2 (+2) 


V 


A 


A 


T 


A 


G 


A 


V 


F 




17 


408.7 (+2) 


Y 


L 


H 


- 


- 


E 


T 








17 


414.8 (+2) 


T 


A 


F 


P 


F 


V 


F 








17 


45 1.4 (+2) 


I 


L 


G 


P 


P 


G 


S 


V 


Y 




17 


462.4 (+2) 


X 


I* 


G 


D 


V 


N 


M 


Y 






17 


476.2 (+2) 


Y 


G 


- 


- 


- 


V 


X 


s 


M 




17 


490.8 (+2) 


X 


P 


H 


C 


S 


C 


S 


s 


F 




17 


504.0 (+2) 


D 


P 


P 


c 


w 


G 


V 


s 


F 




17 


507.0 (+2) 


- 


- 


- 


- 


X 


V 


E 


F 






17 


51 1.2 (+2) 


- 


- 


- 


- 


A 


H 


D 


A 


Y 




17 


519.2 (+2) 


T 


A 


R 


V 


X 


S 


V 


E 


Y 




17 


526.8 (+2) 


X 


S 


D 


G 


R 


Z 


X 


T 


Y 




17 


542.8 (+2) 


N 


M 


N 


D 


L 


V 


S 


E 


Y 




17 


557.2 (+2) 


M 


P 


A 


A 


D 


Y 


E 


V 


A 


F 


IS 


474.8 (+2) 


A 


E 


I 


L 


Q 


V 


I 


Y 






18 


503.8 (+2) 


A 


P 








X 


V 


S 


Y 




18 


514.7 (+2) 


M 


P 


A 


G 


Y 


N 


N 


V 


Y 




18 


519.6 (+2) 


Y 


M 


S 


G 


X 


Y 


G 


T 


F 




18 


526.8 (+2) 








A 


y 


V 


A 


Z 


S 


Y 


18 


538.8 (+2) 


X 


P 


V 


V 


p 


A 


A 


Z 


T 


Y 


18 


566.2 (+2) 


Y 


M 


I 


D 


p 


S 


G 


V 


S 


Y 


18 


616.3 (+2) 


F 


A 


N 


G 


V 


Z 


G 


c 


A 


F A F 



* Dashes represent positions at which amino acids could not be unambiguously assigned through 
NanoES-MS/MS fragmentation patterns and/or Edman data obtained. Underlined residues designate 
tentative assignments. 

TABLE C cont'd- 
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SUBSTITUTE SHEET (RULE 26) 



PCT7US0 1/49744 



iracuon 


inn -fXr A/fQ/A/fC 

ion ior xvio/ivjo 


derived peptide sequence 


6 


493.0 (+2) 


N H 


A 


V 


G 


- 


- 


V 


S M 


6 


557.8 (+2) 


H N 


V 


F 


Z 


P 


T 


S 


N A 


7 


434.8 (+2) 


S V 


C 


E 


T 


E 


S 


X 




7 


iOI o / i 

481.3 (+2) 


T H 


P 


S 


Z 


A 


C 


A 


F 


7 


489.1 


- H 


- 


- 


S 


P 


X 


X 




o 

o 


4ZU.1 


A N 


X 


E 


G 


P 


H 


T 




o 
5 


A A t T / t 0\ 

441.7 (+2) 


G H 


s 


P 


P 


T 


S 


S 


L 


o 
O 


494.0 


C H 


s 


A 


F 


A 


L 






o 
5 


Jll.O ) 


H H 


A 


F 


A 


Z 


V 


X 


V 


o 
5 


C 1 ft >l /*_l_T\ 

519.4 (+2) 


D H 


Y 


Y 


X 


A 


G 


s 


X 


ft 

9 


41 1.4 i+2) 


E X 


A 


P 


H 


A 


A 


X 




9 


AO A *2 /_L/7\ 

4Z4.J 


A A 


A 


X 


R 


C 


E 


X 




9 


AOC 1 

42t>. 1 


G H 


Z 


A 


P 


A 


A 


s 


X 


9 


441. / i+zj 


V H 


N 


P 


Z 


S 


S 


X 




ft 
9 


444 J. 


A G 


G 


P 


T 


X 


C 


R 


X 


9 


/JCC £ A i TV 

455. j 


L H 


L 


L 


T 


L 


E 


A 




9 


490.3 (+2) 


A G 


G 


Z 


P 


A 


T 


P 


PAX 


9 


513.1 (+2) 


S H 


Z 


G 


C 


V 


Z 


P 


A V 


9 


520.0 (+2) 


X H 


R 


Li 


C 


S 


P 


T 


X 


10 


404.2 (+2) 


S V 


S 


X 


P 


H 


A 


P 




10 


417.1 (+2) 


A P 


F 


T 


G 


G 


N 


G 


X 


10 


433.8 (+2) 


G H 


D 


P 


D 


S 


P 


A 


A 


10 


446.2 (+2) 


E H 


G 


X 


E 


N 


G 


H 




10 


455.4 (+2) 


E H 


V 


A 


S 


S 


P 


A 


L 


10 


460.4 (+2) 


H H 


A 


P 


C 


G 


V 


S 


X 


10 


464.0 (+2) 


N H 


A 


I 


V 


S 


T 


s 


V 


10 


ASA 1 S 1 

464.7 (+2) 


G H 


Z 


N 


s 


V 


T 


s 


V 


10 


465.3 (+2) 


S H 


Z 


A 


p 


c 


T 


s 


V 


10 


469.4 (+2) 


F V 


A 


R 


F 


V 


S 


X 




10 


469.6 (+2) 


H H 


S 


D 


G 


s 


V 


s 


L 


10 


473.7 (+2) 


S H 


A 


G 


A 


p 


P 


P 


T X 


10 


482.6 (+2) 


M C 


Z 


- 


G 


M 


p 


A 


X 


10 


482.8 (+2) 


G H 


G 


A 


N 


N 


D 


P 


A X 


10 


495.7 (+2) 


X H 


S 


Z 


P 


A 


G 


P 


A X 


10 


508.3 (+2) 


X H 


V 


V 


S 


- 


- 


V 


X 


10 


511.2 (+2) 


A V 


X 


V 


C 


C 


Z 


V 


A V 


10 


522.3 (+2) 


E X 


G 


G 


N 


T 


N 


P 


Z X 


10 


522.7 (+2) 


Y H 


G 


S 


Z 


N 


P 


E 


X 


1U 


joy.u \T** ) 










T 


Y 


S 


Y 


10 


574.3 (+2) 
















M 


11 


405.7 (+2) 


S H 








Y 


F 






11 


425.8 (+2) 


A H 


P 


D 


Z 


A 


X 


V 




11 


444.7 (+2) 


G T 


A 


H 


Y 


Z 


V 


X 




11 


448.9 (+2) 


M H 


A 


D 


N 


P 


V 


X 




11 


455.7 (+2) 


S H 


V 


D 


R 


P 


s 


X 




11 


459.7 (+2) 


T G 


A 


A 


F 


z 


N 


p 


X 


11 


482.8 (+2) 


G H 


C 


P 


R 


N 


P 


A 


X 


11 


495.7 (+2) 


X H 


S 


G 


A 


P 


Z 


A 


P X 


11 


516.7 (+2) 


X H 


D 


T 


E 


H 


A 


P 


X 


11 


562.3 (+2) 






Y 


% 


A 


Y 


V 


Y 



TABLE D 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



12 


41 1.7 (+2) 


G H 


G P 


T 


X A A V 


12 


428.8 (+2) 


V P 








12 


444.7 (+2) 


Y Q 


H T 


G 


A V L 


12 


448.4 (+2) 


T Q 


A P 


G 


N P V li 


12 


460.3 (+2) 


T Z 


A G 


C 


M V P X 


12 


490.9 (+2) 


T H 


T Q 


P 


G V Q L 


12 


507.4 (+2) 


G H 


A G 


H 


V P P E X 


12 


51-1.6 )+2) 


T H 


F R 


Y 


V S X 


12 


528.1 (+2) 


E H 


R P 


D 


R V F 


13 


427.6 (+2) 


S H 


A Q 


T 


V V L 


13 


449.2 (+2) 


S H 


A N 


S 


A V V L 


13 


464.8 (+2) 


M V 


- _ 


H 


P V X 


13 


487.6 (+2) 


y h 


H G 


G 


V S A F 


13 


506.2 (+2) 


- H 


- - 


G 


H T G Y X 


14 


420.1 (+2) 


N H 


A N 


G 


L T L 


14 


438.7 (+2) 








P X X 


14 


456.7 (+2) 


A H 


S V 


P 


S P A F 


14 


477.7 (+2) 


M H 


T - 


- 


P A P V 


14 


482.8 (+2) 


- P G 


A A 


V 


V P S X 


14 


560.8 (+2) 


G H 


A G 


M 


G C V F Z X 


14 


592.3 (+2) 


M R 


- - 


- 


- G X E X 


15 


418.9 (+2) 


S H 


G V 


P 


RAX 


15 


439.0 (+2) 


E H 


H M 


P 


X X 


15 


454.3 (+2) 


H H 


Z C 


A 


A G A X 


15 


492.1 (+2) 


X V 


D Z 


A 


E P X V 


15 


510.1 (+2) 


I H 


T P 


E 


N P V I 


15 


520.0 (+2) 


M G 


X P 


V 


R H M V 


15 


524.2 (+2) 


S H 


Y D 


W 


Z V X 


15 


532.9 (+2) 


M P 


H S 


H 


P F V X 


15 


577.2 (+2) 


Z C 


V R 


C 


Z N G V F 


16 


412.9 (+2) 


S H 


A G 


A 


G X V X 


16 


418.3 (+2) 


G H 


X E 


G 


P X X 


16 


424.3 (+2) 


X H 


G G 


D 


H V X 


16 


448.6 (+2) 


E Z 


A H 


S 


X V X 


16 


448.9 (+2) 


Y H 


H D 


X 


V X 


16 


454.3 (+2) 


M A 


G A 


W 


C R X 


16 


456.7 (+2) 


S H 


D G 


s 


V P T X 


16 


464.2 (+2) ' 


F H 


- 


X 


X X 


16 


469.9 (+2) 


E H 


- - 


- 


T V X 


16 


472.3 (+2) 


M A 


X - 


- 


- _ v V 


16 


499.0 (+2) 


G H 


A X 


T 


D G X T X 


16 


504.1 (+2) 


P V 


S H 


X 


V N E L 


16 


507.7 (+2) 


X X 


Y T 


p 


G H T X 


16 


522.7 (+2) 








- P V X 


16 


523.3 (+2) 


M A 


H S 


- 


- P V F 


16 


529.9 (+2) 


X H 


Y D 


R 


N Q X 


16 


536.2 (+2) 


E A 




C 


Z V T T Y 


16 


547.9 (+2) 








- A X S V 


16 


552.4 (+2) 


X Z 


A P 


T 


S V F Z X 


17 


367.7 (+3) 


F T 


M P 


A 


H P S T X 


17 


490.8 (+2) 


M T 


X G 


Y 


G E P X 


17 


557.3 (+2) 


A H 


G R 


K 


M S K S L 


17 


340.7 (+3) 


- H 




H 


A Z V X 


17 


367.7 (+3) 






R 


X S H X 


17 


419.8 (+2) 




- H 


A 


V G X X 



TABLE D cont'd. 
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SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



17 


462.8 (+2) 


M 


S 


S 


N 


E 


X 


X 


M 






17 


476.2 (+2) 


G 


H 


- 


- 


- 


- 


P 


C 


c 




17 


504.2 (+2) 


X 


H 


V 


X 


A 


V 


N 


E 


X 




17 


523.2 (+2) 


- 


H 












P 


V 


F 


17 


543.2 (+2) 


X 


H 


E 


V 


Z 


P 


H 


X 


X 




17 


590.2 (+2) 


A 


T 


E 


H 


c 


F 


y 


M 


E 


X 


18 


456.4 (+2) 


A 


H 


S 


N 


L 


A 


s 


V 


L 




18 


463.3 (+2) 


V 


X 


A 


P 


A 


N 


D 


X 


X 




18 


474.8 (+2) 


M 


X 


G 


X 


S 


F 


p 


A 


X 




1 o 
15 


491.2 (+2) 


V 


H 


T 


c 


V 


N 


p 


V 


X 




18 


497.8 (+2) 


S 


H 


Q 


R 


Q 


L 


II 


L 






18 


515.8 (+2) 


E 


W 


H 


Y 


p 


V 


s 


X 






18 


5 19.7 (+2) 


F 


H 


M 


D 


X 


Z 


T 


F 






18 


543.4 (+2) 


X 


H 


E 


V 


z 


P 


H 


X 


X 




18 


596.8 (+2) 


F 


H 


H 


T 


z 


S 


N 


P 


X 


X 


19 


434.6 (+2) 




H 


G 


C 


p 


G 


M 


P 


X 




19 


496.6 (+2) 


E 


T 


P 


E 


H 


A 


P 


V 


X 




19 


539.6 (+2) 


M 


X 


P 


G 


N 


S 


A 


X 


Y 


X 



* Dashes represent positions at which amino acids could not be unambiguously assigned through 
NanoES-MS/MS fragmentation patterns and/or Edman data obtained. Underlined residues designate 
tentative assignments. 

TABLE D cont'd. 
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fraction 


ion for MS/MS 


derived peptide sequence 


7 


504.1 (+2) 


HMSGZPTSY 


7 


549.2 (+2) 


HNZAAHZEY 


8 


517.0 (+2) 


NQZHGSAEY 


8 


526.0 (+2) 


HAAXYSZVY 


8 


642.7 (+2) 


PMNDWXMTZTY 


10 


465.3 (+2) 


SQFGGGSQY 


10 


484.3 (+2) 


Y Q S D H R Y 


10 


509.0 (+2) 


FLZAMZSTY 


10 


532.0 (+2) 


TVXDSZTHY 


11 


424.3 (+2) 


H X S T Z D F 


11 


464.3 (+2) 


HAPPTDPPP 


11 


550.0 (+2) 


HGPANRDSVF 


11 


563.3 (+2) 


FPYPTDPZY 


12 


531.2(+2) 


KNANLVQLY 


14 


585.6 (+2) • 


RSFXXENEY 


16 


488.7 (+2) 


HMZNPTSY 


16 


661.9 (+2) 


YVXF----VY 


17 


577.6 (+2) 


RSMXRCPEY 


18 


451.1 (+2) 


ILGPPGSVY 


18 


523.0 (+2) 


- - FVTAZTY 


19 


565.6 (+2) 


YMIDPSGVSY 


19 


503.8 (+2) 


SQXAAGVDVF 


20 


560.0 (+2) 


XVEXTTDYY 


20 


582.4 (+2) 


MYNCNEXDY 


21 


448.2 (+2) 


AAGXGPTFY 


21 


614.0 (+2) 


IAVGYVDDTQF 


22 


507.2 (+2) 


VAFVXFVGY 


22 


557.2 (+2) 


YNRWSXEF 


23 


510.8 (+2) 


ALMP - - XNY 


25 


562.8 (+2) 


NQFQALLQY 



* Dashes represent positions at which amino acids could not be unambiguously assigned through 
NanoES-MS/MS fragmentation patterns and/or Edman data obtained. Underlined residues designate 
tentative assignments. 
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What I claim is: 

1. A method for the production of soluble HLA molecules in a cell pharrn 
having an appropriate growth media therein, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR, wherein the amplification 
utilizes at least one locus-specific primer having a stop codon 
incorporated into a 3' primer thereby resulting in a truncated PCR 
product having the coding regions encoding cytoplasmic and 
transmembrane domains of the allelic cDNA removed such that the 
truncated PCR product has a coding region encoding a soluble HLA 
molecule; 

inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding a soluble HLA molecule; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules. 
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2. The method according to claim 1, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 

3. The method according to claim 1 wherein the soluble HLA molecules are 
Class I HIA molecules or Class II HLA molecules. 

4. The method according to claim 1 wherein, in the step of isolating HLA 
allele mRIMA from a source, the source is selected from the group 
consisting of mammalian DNA and an immortalized cell line. 

5. The method according to claim 1 wherein, in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 
increased expression of the truncated PCR product. 

6. The method according to claim 1 wherein, in the step of electroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
molecules. 

7. A method for the production of soluble HLA molecules in a cell pharm 
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having an appropriate growth media therein, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR 7 wherein the amplification 
utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PCR product having the 
coding regions encoding cytoplasmic and transmembrane domains 
of the allelic cDNA removed such that the truncated PCR product 
has a coding region encoding a soluble HLA molecule; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding a soluble HLA molecule; 
electroporating the plasmid containing the truncated PCR product 
int& at least one suitable host cell; and 

inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules. 

8. The method according to claim 7, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 
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9. The method according to claim 7 wherein the soluble HLA molecules are 
Class I HLA molecules or Class II HLA molecules. 

10. The method according to claim 7 wherein, in the step of isolating HLA 
allele mRNA from a source, the source is selected from the group 
consisting of mammalian DNA and an immortalized cell line. 

11. The method according to claim 7 wherein, in the step of amplifying the 
allelic cDNA by PCR, the locus-specific primer includes a sequence 
encoding a tail such that the soluble HLA molecule encoded by the 
truncated PCR product contains a tail attached thereto that facilitates in 
purification of the soluble HLA molecules produced therefrom. 

12. The method according to claim 7 wherein, in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 
increased expression of the truncated PCR product. 

13. The method according to claim 7 wherein, in the step of electroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
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molecules. 

14. A method for the production of soluble HLA molecules in a cell pharm 
having an appropriate growth media therein, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR, wherein the amplification 
utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PCR product having the 
coding regions encoding cytoplasmic and transmembrane domains 
of the allelic cDNA removed such that the truncated PCR product 
has a coding region encoding a soluble HLA molecule; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding a soluble HLA molecule; " 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating the ceil pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules, wherein the 
soluble HU\ molecules are folded naturally and are trafficked 
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through the cell in such a way that they are identical in functional 
properties to an HLA molecule expressed from the HLA allele 
mRNA and thereby bind peptide ligands in an identical manner as 
full-length, cell-surface-expressed HLA molecules. 

15. The method according to claim 14, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 

16. The method according to claim 14 wherein the soluble HLA molecules are 
Class I HLA molecules or Class II HLA molecules. 

17. The method according to claim 14 wherein, in the step of isolating HLA 
allele mRNA from a source, the source is selected from the group 
consisting of mammalian DNA and an immortalized cell line. 

18. The method according to claim 14 wherein, in the step of amplifying the 
allelic cDNA by PCR, the at least one locus-specific primer is a 3 1 primer 
having a stop codon incorporated therein. 

19. The method according to claim 14 wherein, in the step of amplifying the 
allelic cDNA by PCR, the locus-specific primer includes a sequence 
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encoding a tail such that the soluble HLA molecule encoded by the 
truncated PCR product contains a tail attached thereto that facilitates in 
purification of the soluble HLA molecules produced therefrom. 

20. The method according to claim 14 wherein, in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 
increased expression of the truncated PCR product. 



21. The method according to claim 14 wherein, in the step of electroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
molecules. 

22. A method for the production of soluble HLA molecules in a cell pharm 
having an appropriate growth media therein, comprising the steps of: 

obtaining gDNA which encodes a HLA allele; 

amplifying the allelic gDNA by PCR, wherein the amplification 
utilizes at least one locus-specific primer having a stop codon 
incorporated into a 3' primer thereby resulting in a truncated PCR 
product having the coding regions encoding cytoplasmic and 
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transmembrane domains of the allelic gDNA removed such that the 
truncated PCR product has a coding region encoding a soluble HLA 
molecule; 

inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding the soluble HLA molecule; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules. 

23. The method according to claim 22, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 

24. The method according to claim 22, wherein the soluble HLA molecules are 
Class I HLA molecules or Class II HLA molecules. 

25. The method according to claim 22, wherein, in the step of obtaining 
gDNA which encodes a HLA allele, the gDNA is obtained from blood, 
saliva, hair, semen, or sweat. 



224 

SUBSTITUTE SHEET (RULE 26) 



WO 02/062846 



PCT/US01/49744 



The method according to claim 22, wherein, in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 
increased expression of the truncated PCR product. 

The method according to claim 22, wherein, in the step of electroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
molecules. 

A method for the production of soluble HLA molecules in a cell pharm 
having an appropriate growth media therein, comprising the steps of: 
obtaining gDNA encoding a HLA allele; 

isolating HLA allele mRNA from gDNA and reverse transcribing the 
mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR, wherein the amplification 
utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PCR product having the 
coding regions encoding cytoplasmic and transmembrane domains 
of the allelic cDNA removed such that the truncated PCR product 
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has a coding region encoding a soluble HLA molecule; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding a soluble HLA molecule; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules. 

29. The method according to claim 28, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 

30. The method according to claim 28, wherein the soluble HLA molecules are 
Class I HLA molecules or Class II HLA molecules. 

31. The method according to claim 28, wherein in the step of obtaining gDNA 
which encodes a HLA allele, the gDNA is obtained from blood, saliva, hair, 
semen, or sweat. 

32. The method according to claim 28, wherein, in the step of amplifying the 
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allelic cDNA by PCR, the locus-specific primer includes a sequence 
encoding a tail such that the soluble HLA molecule encoded by the 
truncated PCR product contains a tail attached thereto that facilitates in 
purification of the soluble HLA molecules produced therefrom. 

33. The method according to claim 28, wherein in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 
increased expression of the truncated PCR product. 

34. The method according to claim 28, wherein in the step of electroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
molecules. 



35. A method for the production of soluble HLA molecules in a cell pharm 
having an appropriate growth media therein, comprising the steps of: 
obtaining gDNA encoding a HLA allele; 

isolating HLA allele mRNA from gDNA and reverse transcribing the 
mRIMA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR, wherein the amplification 
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utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PCR product having the 
coding regions encoding cytoplasmic and transmembrane domains 
of the allelic cDNA removed such that the truncated PCR product 
has a coding region encoding a soluble HLA molecule; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product 
having the coding region encoding a soluble HLA molecule; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces soluble HLA molecules, wherein the 
soluble HLA molecules are folded naturally and are trafficked 
through the cell in such a way that they are identical in functional 
properties to an HLA molecule expressed from the HLA allele 
mRIMA and thereby bind peptide ligands in an identical manner as 
full-length, cell-surface-expressed HLA molecules. 

. The method according to claim 35, further comprising the step of 
harvesting the soluble HLA molecules from the cell pharm. 
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37. The method according to claim 35, wherein the soluble HLA molecules are 
Class I HLA molecules or Class II HLA molecules. 

38. The method according to claim 35, wherein in the step of obtaining gDNA 
which encodes a HLA allele, the gDNA is obtained from blood, saliva, hair, 
semen, or sweat. 

39. The method according to claim 35, wherein in the step of amplifying the 
allelic cDNA by PCR, the at least one locus-specific primer is a 3' primer 
having a stop codon incorporated therein. 

40. The method according to claim 14 wherein, in the step of amplifying the 
allelic cDNA by PCR, the locus-specific primer includes a sequence 
encoding a tail such that the soluble HLA molecule encoded by the 
truncated PCR product contains a tail attached thereto that facilitates in 
purification of the soluble HLA molecules produced therefrom, 

41. The method according to claim 14 wherein, in the step of inserting the 
truncated PCR product into a mammalian expression vector, the 
mammalian expression vector contains a promoter that facilitates 

t 
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increased expression of the truncated PCR product. 

42. The method according to claim 14 wherein, in the step of el.ectroporating 
the plasmid containing the truncated PCR product into at least one 
suitable host cell, the suitable host cell lacks expression of Class I HLA 
molecules. 

43. A multimeric HLA complex, comprising: 

a substrate; and 

at least two soluble HLA molecules attached to the substrate. 

44. The multimeric HLA complex of claim 43, wherein the multimeric HLA 
complex is used to test functionality of peptide ligands bound by the at 
least two soluble HLA molecules. 

45. A method for producing HLA molecules, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA, wherein the amplification utilizes at 
least one locus-specific primer having a stop codon incorporated 
into a 3' primer thereby resulting in a truncated product; 
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inserting the truncated product into a vector to form a plasmid 
containing the truncated product; 

electroporating the plasmid containing the truncated product into 
at least one suitable host cell; and 

inoculating a cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated product such that 
the cell pharm produces HLA molecules. 

46. A method for producing HLA molecules, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA, wherein the amplification utilizes at 
least one locus-specific primer that truncates the allelic cDNA, 
thereby resulting in a truncated product; 

inserting the truncated product into a vector to form a plasmid 
containing the truncated product; 

electroporating the plasmid containing the truncated product into 
at least one suitable host cell; and 

inoculating a cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated product such that 
the cell pharm produces HLA molecules. 
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47. A method for producing HLA molecules, comprising the steps of: 

isolating HLA allele mRNA from a source and reverse transcribing 
the mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PCR, wherein the amplification 
utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PCR product; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating a cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces HLA molecules. 

48. A method for the producing HLA molecules, comprising the steps of: 

obtaining gDNA which encodes a HLA allele; 
amplifying the allelic gDNA, wherein the amplification utilizes at 
least one locus-specific primer having a stop codon incorporated 
into a 3' primer thereby resulting in a truncated product; 
inserting the truncated product into a vector to form a plasmid 
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containing the truncated product; 

electroporating the plasmid containing the truncated product into 
at least one suitable host cell; and 

inoculating the eel! pharm with the at least one suitable host cell 
containing the plasmid containing the truncated product such that 
the cell pharm produces HLA molecufes. 



. A method for the producing HLA molecules, comprising the steps of: 
obtaining gDNA encoding a HLA allele; 

isolating HLA allele mRNA from gDNA and reverse transcribing the 
mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA, wherein the amplification utilizes at 
least one locus-specific primer that truncates the allelic cDNA, 
thereby resulting in a truncated product; 

inserting the truncated product into a vector to form a plasmid 
containing the truncated product; 

electroporating the plasmid containing the truncated product into 
at least one suitable host cell; and 

Inoculating the cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated product such that 
the cell pharm produces HLA molecules. 
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50. A method for the producing HLA molecules, comprising the steps of: 
obtaining gDNA encoding a HLA allele; 

isolating HLA allele mRNA from gDNA and reverse transcribing the 
mRNA to obtain allelic cDNA; 

amplifying the allelic cDNA by PGR, wherein the amplification 
utilizes at least one locus-specific primer that truncates the allelic 
cDNA, thereby resulting in a truncated PGR product; 
inserting the truncated PCR product into a mammalian expression 
vector to form a plasmid containing the truncated PCR product; 
electroporating the plasmid containing the truncated PCR product 
into at least one suitable host cell; and 

inoculating a cell pharm with the at least one suitable host cell 
containing the plasmid containing the truncated PCR product such 
that the cell pharm produces HLA molecules. 



51, An HLA complex, comprising: 
a substrate; and 

at least one HLA molecule attached to the substrate. 

i 
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